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SpaceX ignites big dreams 


The successful launch of the Falcon Heavy rocket was a stunning moment that opens the way for 


commercial exploration of deep space. 


hen David Bowie revealed the inspiration for his break- 
Wiese hit ‘Space Oddity’ it came as a surprise to many: 

not the Apollo missions to the Moon but the film 2001: A 
Space Odyssey (Kubrick, 1968). And it’s easy to see why: the film's ballet 
of the spinning space station and dark themes about our place in the 
Universe has a distinct aesthetic appeal. 

Does space travel still inspire? The continuing high profile of the 
International Space Station (ISS) and the welcome early steps of China 
and others aside, an extraordinary thing has happened. For much of 
the public, humanity’s adventures in space have become history, some- 
thing people used to do in the old days. (Exhibit A: the Space Shuttle, 
once a means to escape the surly bonds of Earth, is now a museum 
piece.) 

But spaceflight suddenly seems futuristic again. How else to explain 
the reaction to the spectacle of last week’s successful maiden launch of 
Falcon Heavy, the giant rocket built by Elon Musk and his company 
SpaceX. The hashtag generation has witnessed its own moment of 
inspiration writ large in the heavens. 

The launch wasnt perfect, but it still came across as a stunning syn- 
chrony of power and control. The 27 Merlin engines blazed to lift their 
giant cargo towards the sky, accompanied by cheers and whoops on the 
ground. And then, with exquisite control, the side boosters separated, 
back-flipped in tandem towards Earth and settled on landing pads at 
Florida's Cape Canaveral Air Force Station with a rapid volley of ear- 
splitting sonic booms. 

The Saturn V rocket NASA created in the 1960s for the Apollo mis- 
sions remains the most powerful rocket ever built. But since Apollo 
there has been an almost 50-year hiatus in missions to take people 
beyond low Earth orbit. The successful launch of the Falcon Heavy will 
surely mark the beginning of the end of this hiatus. It is nothing short 
of historic, and Musk deserves enormous credit. Yes it was flashy — 
gimmicky, even — but there was substance behind the style. 

Not without his faults, Musk has nevertheless shown himself to be a 
visionary with verve, can-do grit, a dash of genius and an abundance 
of hubris. SpaceX has roundly confounded many naysayers and up- 
ended the launcher industry. Tesla, Musk’s electric-vehicle company, 
faces enormous challenges, but has still helped to persuade many in 
the automotive industry that fossil-fuel engines belong in museums 
alongside the shuttle. 

Musk’s goals for space are characteristically audacious. He intends 
SpaceX’s Dragon capsule to carry people later this year, and Falcon 
Heavy’s less powerful sibling — the Falcon 9 workhorse — to ferry 
astronauts to the ISS. 

But it is the void beyond on which all sights are now set. Com- 
mercial companies have finally cracked access to deep space. Access 
to interplanetary trajectories — and the decisions about where to 
head — is no longer limited to governments. Musk wants to take 
people to Mars, which still seems a long shot — but space tourism 


suddenly feels like a more realistic prospect. 

Together, Musk and his rocket are showing what a combination 
of big ideas and big money can achieve, and so inspiring the next 
generation to dream big. Still, for all the thrill of the new, it’s worth 
remembering just how much last week’s launch — like so much in 

science — builds on the achievements of 


“Access to others. Falcon Heavy took off from the his- 
interplanetary toric Launch Complex 39A (LC-39A) at the 
trajectories Kennedy Space Center on Merritt Island in 
is no longer Florida, from where the Apollo Moon land- 
limited to ings also began their journeys. 


And although Musk’s choice of test pay- 
load — his own cherry-red Tesla Roadster 
with a mannequin at the wheel and playing ‘Space Oddity’ on aloop — 
received as much attention as the rocket that carried it, SpaceX is not 
the first to put a car into space. From LC-39A, NASA did the same. 
Then it landed its car on the Moon. And then it drove it around. Not 
bad for the old days. m 


governments.” 


Climate conflict 


Many studies that link global warming to civil 
unrest are biased and exacerbate stigma. 


drought, and the resulting water shortages could soon force 

authorities to turn off the taps. Could civil unrest follow? When 
southern Brazil experienced a similar shortage in 2015, stories cir- 
culated about the authorities running drills to prepare a response to 
desperate people attacking water infrastructure. And a study pub- 
lished last year of some 1,800 riots in sub-Saharan Africa over 20 years 
concluded that drought can indeed be a powerful contributor to civil 
disorder (C. Almer et al. J. Environ. Econ. Manage. http://doi.org/ckdj; 
2017). 

Such retrospective analyses raise two questions related to cause and 
effect: did climate change alter the weather? And did the change in the 
weather provoke the conflict? Only a solid yes to both can justify bold 
statements that global warming promotes violence — and establishing 
this answer is difficult, ifnot impossible, in many cases. 

That hasn't stopped such controversial claims being made. A decade 
ago, the United Nations went as far as to state that climate warming 
and desertification were one of the causes of the Darfur conflict in 
Sudan, which started in 2003 and led to the deaths of up to halfa 
million people over five years of revolt. That daring claim, based on 


Te people of Cape Town, South Africa, are enduring a terrible 
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sketchy information, met with harsh criticism and outright disbelief 
from researchers familiar with the region. But it also triggered growing 
interest in climate-conflict research. 

Results so far are largely ambiguous and have been frequently 
questioned by political scientists, economists, social scientists and 
climate experts, on various grounds. This week, a systematic review 
of the literature highlights one problem: efforts to find links between 
climate and social conflict are hampered by a severe sampling bias, 
including a statistically and politically dubious focus on mainly 
African countries formerly under British colonial rule. 

The study, published in Nature Climate Change, states what critics 
have long suspected: conclusions that climate change is triggering 
violent conflict cannot be generalized, and are hard to substanti- 
ate even in individual cases (C. Adams et al. Nature Clim. Change 
http://doi.org/ckfw; 2018). Researchers are drawn to regions that 
experience violence, rather than to those where climate change is 
most severe, they write. And the countries that are easiest to study — 
because of historical links, language and ease of transport — are often 
prioritized over nations that might experience more climate change or 
more violence, but are less convenient for research. (Kenya is a good 
example: it is one of the most studied countries, yet it is not near the 
top of the list in terms of either violence or climate impacts.) 

Skewed results pay a disservice to science and can undermine 
peace-keeping efforts. Climate change is never the sole cause of 
war, violence, unrest or migration. Syria and Jordan have both been 
stricken by drought this decade. But it’s clear that different social, 
political and economic factors in the two nations explain why people 
are desperate to flee from Syria and not from Jordan. 

Done correctly, climate-conflict research is certainly valuable. Asa 


global human enterprise, any science must be concerned with social 
justice and peace. Rigorous investigation into how climate change 
might affect — and perhaps violently disrupt — societies or human 
civilization at large has its place. But first, researchers in the field must 

improve their methods. 
There is a political implication to this sampling bias, too. To search 
for climate-conflict links in places where violent struggle is taking 
place, or has only recently ended — and to 


“Climate change __ pursue such research with a geographical bias 
isneverthesole towards a few, relatively accessible regions 
cause of war, in Africa — threatens to stigmatize trou- 


bled countries as being prone to even more 
instability in the future. With a view to social 
justice in science, this would be grossly unde- 
sirable. And it is a flawed approach to answering important questions 
about the socioeconomic and political conditions in which climate- 
related conflict is likely to emerge. Instead, scientists must consider 
whether peaceful responses to climate change are the norm in most 
countries. 

There is a yawning disconnect between the needs of countries in 
the developing world, many of which sit on the front line of climate 
change, and the priorities of scientists in the developed world who 
carry out most of the research. To address this, climate researchers 
must seek fresh opportunities to provide decision-makers in the devel- 
oping world with the kinds of data and projections that they most 
need — including attribution studies, which aim to assess the extent 
to which specific weather events are due to climate change. This will 
help the most vulnerable societies mobilize to adapt to climate change, 
and will offer some much-needed security. m 


violence, unrest 
or migration.” 


Personal papers 


From proposals to gripes, scientists sneak 
messages into their papers. 


o mark St Valentine's Day, Nature this week published a collection 
of stories of romance kindled and sealed by science (see 
go.nature.com/2foalrk). One describes a science writer who was 
asked to investigate unusual crystals in a particle collider, and on her 
arrival there, was surprised by her partner, who proposed; another con- 
cerns a palaeontologist who stashed an engagement ring in a stream bed. 

Then there are the declarations and proposals buried in the acknowl- 
edgements ofa scientific paper. What could be more romantic than an 
analysis of the cooling power of a fridge? Answer: an analysis of the 
cooling power of a fridge that ends with the words: Will you marry me? 

That's how Rui Long, a PhD student in engineering at Huazhong 
University of Science and Technology in Wuhan, China, proposed to 
his partner Panpan Mao, in a paper published online last month in 
Physica A: Statistical Mechanics and its Applications. 

He is not the first: a similar line in the acknowledgements of a 2015 
Current Biology paper describing a new dinosaur sent viral the pro- 
posal of Caleb Brown to his girlfriend and fellow museum scientist 
Lorna O’Brien. The proposal method has its risks: it relies, of course, 
on the person being proposed to actually reading the acknowledge- 
ments. (In at least one case, an anxious proposer had to ask his partner 
to try again.) There are other more serious concerns: that the person 
proposed to will feel coerced. Many critics argue against public propos- 
als — from those in YouTube videos to hijacked sporting events — for 
this reason. 

Proposals are certainly not the only messages that scientists 
have smuggled into their academic acknowledgements. Fund- 
ing agencies have been ‘thanked’ for steering research by refusing 
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previous applications, and scolded for not paying their bills. Sports fans 
have slipped in references to favourite teams, and imaginary people have 
been credited to pay homage to popular culture, such as The Simpsons 
TV show and, in one case, the thrash-metal band Slayer. 

Even the text of the paper is not immune. Peer reviewers, it seems, 
must be on the lookout for striking similarities to lines from Star Wars 
— and, infamously, everybody missed that an interloper had drawn a 
stick man fishing in a water tank in a schematic diagram included ina 
1955 paper in the Journal of the American Chemical Society. 

Authorship of papers is also ripe for mischief making. Physical 
Review Letters published a paper in 1973 written by the US physicist 
and mathematician Jack Hetherington and E D. C. Willard. Willard — 
who subsequently published as a sole author — was Hetherington’s cat. 
And in 2001, materials scientist Andre Geim co-authored a Physica B: 
Condensed Matter paper on Earth’s rotation with “H. A. M.S. ter Tisha’. 
(It’s not clear how the hamster contributed.) Various groups of authors 
have claimed in their papers that the order in which their names appear 
was determined by non-standard methods, including in one case, a 
25-game croquet series. 

Tinkering with the names on academic publications should not be 
undertaken lightly. South Korea announced earlier this month that it 
was widening an investigation into the possibility that some scientists 
added the names of their children and other relatives. In certain cases, 
the practice is thought to be intended to give the children an edge when 
applying to university, a highly competitive process in which, it seems, 
a publication record might help (see Nature 554, 154-155; 2018). 

How common are personal messages in papers? A straw poll of 
Nature’s manuscript editors failed to produce any confirmed examples 
in our pages. But at least one has slipped through. In an online discus- 
sion of the practice from 2011, microbiologist Rosie Redfield writes: 
“T once thanked Howard Ochman for ‘pharmacological support’ on 
a theory paper (in Nature!). He had given me a pound of excellent 
coffee beans.” We checked, and it’s true. But no more, please. As our 
guidelines to authors state: focus on the science, and avoid the risks and 
distractions of personal messages that might misfire. = 
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many PhD programmes in the biomedical sciences have short- 

ened their courses, squeezing out opportunities for putting 
research into its wider context. Consequently, most PhD curricula 
are unlikely to nurture the big thinkers and creative problem-solvers 
that society needs. 

That means students are taught every detail of a microbe’ life cycle 
but little about the life scientific. They need to be taught to recognize 
how errors can occur. Trainees should evaluate case studies derived from 
flawed real research, or use interdisciplinary detective games to find logi- 
cal fallacies in the literature. Above all, students must be shown the sci- 
entific process as it is — with its limitations and potential pitfalls as well 
as its fun side, such as serendipitous discoveries and hilarious blunders. 

This is exactly the gap that I am trying to fill at 
Johns Hopkins University in Baltimore, Mary- 
land, where a new graduate science programme 
is entering its second year. Microbiologist Arturo 
Casadevall and I began pushing for reform in 
early 2015, citing the need to put the philoso- 
phy back into the doctorate of philosophy: that 
is, the ‘Ph’ back into the PhD. We call our pro- 
gramme R3, which means that our students learn 
to apply rigour to their design and conduct of 
experiments; view their work through the lens of 
social responsibility; and to think critically, com- 
municate better, and thus improve reproducibil- 
ity. Although we are aware of many innovative 
individual courses developed along these lines, 
we are striving for more-comprehensive reform. 

Our offerings are different from others at the graduate level. We 
have critical-thinking assignments in which students analyse errors in 
reasoning in a New York Times opinion piece about ‘big sugar, and the 
ethical implications of the arguments made in a New Yorker piece by 
surgeon Atul Gawande entitled “The Mistrust of Science. Our courses 
on rigorous research, scientific integrity, logic, and mathematical and 
programming skills are integrated into students laboratory and field- 
work. Those studying the influenza virus, for example, work with real- 
life patient data sets and wrestle with the challenges of applied statistics. 

A new curriculum starts by winning allies. Both students and 
faculty members must see value in moving off the standard track. We 
used informal interviews and focus groups to identify areas in which 
students and faculty members saw gaps in their training. Recur- 
ring themes included the inability to apply theoretical knowledge in 
statistical tests in the laboratory, frequent mistakes in choosing an 
appropriate set of experimental controls, and significant difficulty 
in explaining work to non-experts. 

Introducing our programme to colleagues in the Johns Hopkins 
life-sciences departments was even more sensitive. I was startled by 
the oft-expressed opinion that scientific productivity depended more 


| J nder pressure to turn out productive lab members quickly, 


PUT THE 


PHILOSOPHY 
BACK 
INTO THE 


DOCTORATE 


PHILOSOPHY. 


Train PhD students tobe 
thinkers not just specialists 


Many doctoral curricula aim to produce narrowly focused researchers rather 
than critical thinkers. That can and must change, says Gundula Bosch. 


on rote knowledge than on competence in critical thinking. Several 
principal investigators were uneasy about students committing more 
time to less conventional forms of education. The best way to gain 
their support was coffee: we repeatedly met lab heads to understand 
their concerns. 

With the pilot so new, we could not provide data on students’ per- 
formance, but we could address faculty members’ scepticism. Some 
colleagues were apprehensive that students would take fewer courses 
in specialized content to make room for interdisciplinary courses on 
ethics, epistemology and quantitative skills. In particular, they worried 
that the R3 programme could lengthen the time required for students 
to complete their degree, leave them insufficiently knowledgeable in 
their subject areas and make them less productive in the lab. 

We made the case that better critical thinking 
and fewer mandatory discipline-specific classes 
might actually position students to be more pro- 
ductive. We convinced several professors to try 
the new system and participate in structured 
evaluations on whether R3 courses contributed 
to students’ performance. 

So far, we have built 5 new courses from scratch 
and have enrolled 85 students from nearly a dozen 
departments and divisions. The courses cover the 
anatomy of errors and misconduct in scientific 
practice and teach students how to dissect the 
scientific literature. An interdisciplinary discus- 
sion series encourages broad and critical think- 
ing about science. Our students learn to consider 
societal consequences of research advances, such 
as the ability to genetically alter sperm and eggs. 

Discussions about the bigger-picture problems of the scientific 
enterprise get students to reflect on the limits of science, and where 
science’s ability to do something competes with what scientists should 
do from a moral point of view. In addition, we have seminars and 
workshops on professional skills, particularly leadership skills through 
effective communication, teaching and mentoring. 

It is still early days for assessment. So far, however, trainees have 
repeatedly emphasized that gaining a broader perspective has been 
helpful. In future, we will collect information about the impact that 
the R3 approach has on graduates’ career choices and achievements. 

We believe that researchers who are educated more broadly will do 
science more thoughtfully, with the result that other scientists, and 
society at large, will be able to rely on this work for a better, more 
rational world. Science should strive to be self-improving, not just 
self-correcting. = 


Gundula Bosch directs the R3 Graduate Science Initiative at Johns 
Hopkins Bloomberg School of Public Health in Baltimore, Maryland. 
e-mail: gbosch@jhu.edu 
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Harassment policy 


Institutions that receive grant 
monies from the US National 
Science Foundation (NSF) 
must now inform the agency 
if they find that anyone 
funded by an NSF grant has 
committed sexual harassment. 
The policy, which the NSF 
announced on 8 February, also 
requires institutions to report 
if researchers are placed on 
administrative leave related 

to harassment allegations or 
findings, and to lay out clear 
rules defining inappropriate 
behaviour. The requirements 
will take effect after a 60-day 
public-comment period ends. 
US government research 
agencies generally do not 
require grant recipients or 
their employers to disclose 
sexual-harassment allegations 
or findings. See go.nature. 
com/2euvrrr for more. 


POLITICS 


US budget request 
Funding for the US National 
Institutes of Health and the 
National Science Foundation 
would remain flat in fiscal 
year 2019, under the 

budget request released on 
12 February by President 
Donald Trump. But his plan 
would slash funds for the 
Environmental Protection 
Agency to US$6.1 billion 

— the lowest level since the 
1990s — ending the agency’s 
climate research. The proposal 
would also gut climate 
programmes at the National 
Oceanic and Atmospheric 
Administration and cancel 
five Earth-observing missions 
or instruments at NASA. See 
page 284 for more. 


Research threat 
Philippine President Rodrigo 
Duterte has declared that 

the seas around the country 
are off-limits to foreign 


Falcon Heavy soars into the record books 


SpaceX in Hawthorne, California, launched 
the most powerful commercial rocket ever on 
6 February. Its 27-engine Falcon Heavy rocket 
lifted off from NASA's Kennedy Space Center 
in Florida and propelled founder Elon Musk’s 
electric sports car into a looping orbit around 
the Sun. Two of the rocket’s three boosters 
successfully returned to Earth and landed, 


researchers. In a press 
conference on 9 February, 
Duterte said he would order 
the navy to fire on any foreign 
vessels found extracting 


resources from the Philippines’ 


exclusive economic zone 
(EEZ). Under a 1982 United 
Nations treaty, countries 

have exclusive economic 
rights to an area extending 
200 nautical miles (370 
kilometres) from their shores, 
but foreign research vessels 
are often granted access. A 
spokesperson for Duterte said 
that future missions should 
be done by Filipinos; this 
would cut off nations that have 
previously done research in 
the area, such as China, Japan, 
South Korea and the United 
States. The Philippines’ EEZ 
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includes the seismically active 
Benham Rise, which the 
nation renamed the Philippine 
Rise last year. 


Leadership row 
More than 70 members of the 
editorial board of the Journal 
of Molecular Medicine have 
quit over a disagreement 
with the publisher about 

the appointment of anew 
editor-in-chief. The journal's 
long-time editor-in-chief, 
German pharmacologist 
Detlev Ganten, retired 

last year. Journal editors 
recommended Martin Lohse 
of the Max Delbriick Center 
for Molecular Medicine 

in Berlin as his successor. 


marking another step towards SpaceX’s goals 
of reusability. The third booster, however, 
crashed into the Atlantic Ocean when some of 
its engines did not ignite as planned. SpaceX 
streamed live images of Musk’s car as it moved 
farther and farther from Earth, until a final 
engine burn sent it ona course that will 
eventually take it out near Mars. 


But the publisher, Springer 
Nature, instead appointed Ari 
Waisman, an immunologist 
at the Johannes Gutenberg 
University of Mainz in 
Germany. (Nature’s news team 
is editorially independent 

of Springer Nature, which 
also publishes Nature.) As 

of 13 February, the Journal 

of Molecular Medicine listed 
23 remaining editorial board 
members on its website. 


Preprint server 

The American Geophysical 
Union launched a major 
preprint server for the Earth 
sciences on 8 February. The 
Earth and Space Science Open 
Archive is a free community 
site for posting pre-publication 
findings, as well as conference 
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presentations such as posters. 
A competing preprint 

server, EarthArXiv, has been 
operating since last October 
under the auspices of the 
Center for Open Science in 
Charlottesville, Virginia. They 
are the first major initiatives to 
test whether Earth scientists 
will embrace the open-posting 
model pioneered by sites such 
as the arXiv physics server. 


Gender pay gap 


Women who work at the 
Wellcome Trust — one of the 
world’s largest biomedical 
charities — earn a median 
salary that is 21% lower 

than that of their male 
colleagues, the London- 
based organization said on 

7 February. The gap is slightly 
wider than Britain's overall 
gender pay gap of 18%. 
Organizations in the United 
Kingdom that employ more 
than 250 people are now 
required by law to publish 
such information on pay. “We 
see our gender pay gap as one 
important measure of how 
much more we have to do to 
become a truly inclusive place 
to work,’ the trust said in a 
statement. 


Taiwan earthquake 
Officials have started 

recovery efforts following an 
earthquake that struck the east 
coast of Taiwan on 6 February, 


killing 17 people and 

injuring at least 285. The US 
Geological Survey measured 
the earthquake at magnitude 
6.4. The quake’s epicentre 
occurred 18 kilometres north 
of Hualien, the most severely 
affected area. It caused major 
damage to some buildings, 
including a 12-storey 
residential complex that is now 
tilted (pictured) after the first 
four floors gave way. Taiwan 
experiences frequent quakes 
because it lies close to the 
intersection of three tectonic 
plates. As Nature went to press, 


Hualien was still experiencing 
aftershocks. 


Isotope process 
On 8 February, the US Food 
and Drug Administration 
approved a new technology 
for producing a common 
medical isotope without the 
use of enriched uranium. 
Technetium-99m is used 


in roughly 80% of medical- 
imaging procedures in the 
United States, but the country 
has relied entirely on foreign 
imports for the past three 
decades. The global market has 
been rocked by shortages in 
recent years, however, caused 
in part by reactor closures 
abroad. NorthStar Medical 
Radioisotopes in Beloit, 
Wisconsin, uses naturally 
occurring molybdenum-98, 
which absorbs neutrons ina 
nuclear reactor and becomes 
molybdenum-99 — a source 
material that decays into 
technetium-99m. 


FUNDING 
Science budget cuts 


The Brazilian government 
will freeze 10% of the budget 
for its Ministry of Science, 
Technology, Innovation and 
Communications, according 
to an analysis released on 


SHAKING UP METRICS CULTURE 


TREND WATCH 


A survey of UK institutions 
reveals that many of them do 

not have policies to address the 
misuse of metrics such as impact 
factors in academic evaluations. 
The 2012 Declaration on Research 
Assessment (DORA), a global 
accord that aims to eliminate 
such misuse, urges hiring and 
grant panels to assess the quality 
of research papers rather than 
the journals in which they are 
published. Out of 96 institutions 
surveyed, 75 said that they hadn't 
signed DORA or didn’t havea 
research-metrics policy. 


Only 21 UK research organizations out of 96 said in a survey that they 
had signed the San Francisco Declaration on Research Assessment 
(DORA), which promotes the responsible use of research metrics. 


Has your research organization signed DORA? 


0) 10 20 30 40 #50 60 #70 ~~ 80 


Is your research organization considering signing DORA? 


) 10 20 30 40 50 60 #70 ~~ 80 
Number of institutions 
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6 February by the Brazilian 
Society for the Advancement 
of Science. The government 
announced an overall budget 
cut on 2 February, but it was 
several days before each 
ministry could confirm 

how much it would have 

to give up. The move will 
drain 477 million reais 
(US$145 million) from 

the 4.6 billion reais already 
authorized for the ministry 

in 2018. Science minister 
Gilberto Kassab promised not 
to apply those cuts to research 
institutes for now, in the hope 
that the freeze will be at least 
partly reversed in the coming 
months. 


PEOPLE 


Astronomy move 


The University of Turku 

in Finland terminated the 
employment contract of 
astrophysicist Christian 

Ott on 7 February. Ott 
resigned from the California 
Institute of Technology in 
Pasadena last year after the 
institution determined he 
had committed gender-based 
harassment of two graduate 
students. The University 

of Turku announced on 

1 February that it had hired 
Ott, but reversed the decision 
after astronomers inside and 
outside Finland criticized 
the appointment. Through 
his lawyer, Ott declined to 
comment publicly on the 
matter. 


Popular science 
The Royal Institution of Great 
Britain has appointed a new 
director, it announced on 

8 February. Civil engineer 
Shaun Fitzgerald will lead the 
London-based organization, 
which promotes public 
engagement in science and 

is known for its Christmas 
Lectures. His predecessor, 
gerontologist Sarah Harper, 
left the 219-year-old 
organization last September 
after less than four months in 
the job. 


> NATURE.COM 
For daily news updates see: 
WWwwW.nature.com/news 
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for 2019 leaves science 


POLICY Trump’s budget request 
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itself for flood of data from 
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QUANTUM INTERNET Physicists 
aim to harness the power 
of entanglement p.289 
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ENVIRONMENT 


Forests of ponderosa pine (Pinus ponderosa) may never recover from a megafire that burned through New Mexico’s Jemez mountains in 2011. 


Tree rings reveal wildfire 
risk for southwestern US 


Historical record points to climate patterns that prime the region for intense fire seasons. 


BY CALLY CARSWELL 


llis Margolis’s desk is covered in slices 
HE: wood. They’re scattered around his 
computer, piled in cardboard boxes and 
stacked on shelves. The tree rings and burn 
scars imprinted in the wood help the ecologist 
and other scientists predict the potential for 
future fires in the southwestern United States 
by understanding the blazes of the past. 
This year, as Margolis and his colleagues at 
the US Geological Survey (USGS) in Santa Fe, 


New Mexico, go into the field to expand their 
data set, the work feels like a race against time. 
Snowpack is dangerously low in many parts 
of Arizona and New Mexico, setting the stage 
for exceptionally dry conditions that fuel 
these conflagrations. The biggest wildfires in 
New Mexico’s recorded history have erupted 
in just the past decade, torching hundreds 
of thousands of hectares across the state, 
including in the mountains Margolis studies. 
“The lack of snow is scary,” he says. “We're set 
up for a big fire year if things don’t change 


dramatically” 

Storms that could deliver game-changing 
amounts of precipitation are unlikely, 
according to a 6 February federal water-supply 
forecast. And preliminary results from the 
network of sampling sites across two of New 
Mexico's mountain ranges suggest that future 
blazes — even as soon as this summer — could 
be even bigger than recent megafires. 

Since the late 1970s, researchers have used 
tree rings and fire scars to reconstruct the fire 
history of an area and to understand how > 
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> climate drives conflagrations. Tree growth 
rings vary in width with annual precipitation, 
providing a record of past climates. And when 
blazes burn a tree without killing it, they leave 
scars that can be dated along with the rings. 

Initially, researchers tried to understand 
how frequently fires had burned individual 
stands of trees, says Tom Swetnam, a fire 
ecologist at the University of Arizona in 
Tucson who is now based in New Mexico. 
Land-management agencies such as the US 
National Park Service and the US Forest 
Service had followed strict fire-suppression 
policies for decades, but they were beginning 
to recognize fire’s ecological benefits. The 
agencies wanted to know how fire had 
behaved historically, so they could use it as a 
tool to promote forest health. 

As scientists built fire chronologies, patterns 
emerged. Forests in Arizona, New Mexico, 
western Texas and northern Mexico all tended 
to burn in the same years, for instance. And 
Swetnam eventually linked active fire years to 
La Nifia phases, or periods of cooling in the 
Pacific Ocean's equatorial waters, that dried 
out the Southwest’. 

Multiple studies on mid-elevation forests 
of ponderosa pine (Pinus ponderosa) in 
Arizona and New Mexico found’ that before 
widespread fire suppression, low-intensity 
fires had burned through these stands roughly 
every decade. Fire helped to maintain the open 


structure of ponderosa stands, and without it 
the forests grew thick with trees. This made 
them more vulnerable to big, hot fires. 

One such blaze ignited in New Mexico's 
Jemez Mountains in June 2011, when a tree fell 
on a power line. Dubbed the Las Conchas fire, 
it torched about 63,000 hectares, making it the 
largest blaze in New Mexico’s recorded history 

at the time. It burned 


“The lack of incredibly hot, and 
snow is scary. killed so many trees 
We’re set up in certain areas that 
for a big fire scientists are unsure 
year if things whether the forest 
don’t chan ge will ever grow back. 


In some ways, the 
Las Conchas fire was 
an outlier. Its severe temperatures were unusual, 
says Craig Allen, a USGS fire ecologist based in 
the Jemez Mountains, especially in areas where 
ponderosa pines were totally incinerated. 

Yet, in other ways, fires on the scale of Las 
Conchas may be consistent with historical 
norms. For instance, Margolis says, researchers 
don't have a good grasp of whether the size of 
the blaze was truly exceptional. 

This is one of the questions that Margolis 
has pursued by systematically sampling 
trees across the Jemez Mountains, and in 
expanding a similar network in the Sangre 
de Cristo Mountains outside Santa Fe. “It’s so 
basic,’ he says, “but it takes a lot of data to get 


dramatically.” 


there.” Although his analysis of the Jemez data 
isn't complete, Margolis sees strong evidence 
that fires as big as Las Conchas, or even twice 
its size, have occurred for centuries. The 
implication: “We should be more freaked out 
that the fires can get even bigger,’ he says. 

This year, it’s the potential for fire in the 
Sangre de Cristo Mountains that really 
worries Margolis. Many of the area’s forests 
haven't burned in more than 100 years, an 
unnaturally long dormant period, according 
to fire histories he’s reconstructed. That means 
the mountains are stocked with fuel. 

Widespread fires of the past burned on the 
heels of extremely dry winters that followed 
two to three wet winters. That’s exactly the 
pattern New Mexico is currently experiencing, 
because 2016 and 2017 were relatively wet. 

“If we had a Las Conchas fire outside of 
Santa Fe, it would be devastating,” Margolis 
says. The fire itself could threaten life and 
property. The loss of vegetation would leave 
the area vulnerable to post-fire flooding that 
could wipe out roads and clog vital water 
infrastructure with debris. “We're sitting on 
this powder keg,” he says. = 


1. Swetnam, T. W. & Betancourt, J. L. Science 249, 
1017-1020 (1990). 

2. Margolis, E.Q., Huffman, D. W. & Ifiguez, J. M. 
Southwestern Mixed-Conifer Forests: Evaluating 
Reference Conditions to Guide Ecological 
Restoration Treatments. Ecological Restoration 
Institute Working Paper No. 28 (2013). 


Trump budget 
underwhelms 


Many major science agencies would see flat funding. 


BY LAUREN MORELLO, GIORGIA GUGLIELMI, 
SARA REARDON, JEFF TOLLEFSON & 
ALEXANDRA WITZE 


( "Us Pra reigned on 12 February, as 
US President Donald Trump released his 
budget request for the 2019 fiscal year. 

Just four days earlier, the Congress had lifted 
mandatory caps on government spending, 
sending the Trump administration scrambling 
at the last minute to revise its budget proposal. 

The White House abandoned its original plan 

to seek a 27% funding cut for the National 

Institutes of Health (NIH), a 29% decrease for 

the National Science Foundation (NSF) anda 

22% reduction for the Department of Energy 

(DOE) Office of Science, holding their fund- 

ing steady. But the details of Trump’s vision for 


many agencies remain fuzzy, frustrating sci- 
ence advocates. 

“The big headline is that at the eleventh hour, 
[the White House] backed away from their 
intention of dramatically scaling back on basic 
research,’ says Matt Hourihan, director of the 
research and development budget and policy 
programme at the American Association for 
the Advancement of Science in Washington 
DC. But science agencies aren't out of the woods 
yet, he warns. Even in a budget that seems to 
support basic science, “they’re still going after 
programmes, like environmental programmes, 
that they believe fall outside the purview of gov- 
ernment’, Hourihan says. 

Among other things, the Trump request 
would cut the US Environmental Protection 
Agency (EPA) budget to its lowest level since 
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the early 1990s, gut climate-change research 
at the National Oceanic and Atmospheric 
Administration (NOAA) and axe five Earth- 
observing missions and instruments at NASA. 

And although biomedical research holds 
steady under Trump’s final 2019 plan, policy 
watchers remain sceptical of the president's 
intentions. The White House may have 
reversed course on potential cuts to the NIH 
and NSE, but Trump still opposes any funding 
increases for non-military programmes, says 
Jennifer Zeitzer, director of legislative relations 
for the Federation of American Societies for 
Experimental Biology (FASEB) in Bethesda, 
Maryland. “I think itd be really generous to 
read that suddenly NIH and NSF are a priority 
for this administration,” she says. 

Trump is seeking US$34.8 billion for the 
NIH — roughly equal to the level in 2017, the 
last year for which Congress and the White 
House agreed on a final budget (see ‘Roller- 
coaster ride’). But the White House wants 
that money to go further than it does now, by 
creating three new institutes within the NIH. 
Among them would be the National Institute 
for Research on Safety and Quality, which 
would replace the $324-million Agency for 
Healthcare Research and Quality within the 
Department of Health and Human Services. 

The NSF would also see flat funding under 
the Trump plan, with a budget of $7.47 billion. 


SOURCE: AMERICAN ASSOCIATION FOR THE 


> That figure includes $2.2 billion that the White 
House added at the last minute, which it says 
would support basic scientific research, edu- 
cation programmes, upgrades to research 
facilities in Antarctica and elsewhere, and 
cross-disciplinary research activities. But it 
has not provided a more detailed explanation 
of how that money would be allocated. 

Trump’s final proposal for 2019 would boost 
spending across the NSF’s 7 research directo- 
rates by 2%, to $6.151 billion. But it would 
cut, by 56%, funding for the construction of 
research platforms — such as the agency’s suite 
of telescopes — and the acquisition of scientific 
instrumentation. The budget for that account 
would drop from $215 million in 2017 to 
$95 million in 2019. 

NASA would abandon the International 
Space Station under the Trump plan, which 
calls for the space agency to terminate its con- 
tribution to the 15-nation facility in 2024, after 
the current US commitment ends. The admin- 
istration wants NASA to explore how to turn 
space-station operations to private industry in 
2025, but the plan is unlikely to fly with many 
members of Congress. 

The proposed budget would also cancel 
the Wide-Field Infrared Survey Telescope 
(WFIRST), which is designed to hunt for exo- 
planets and dark matter. It has been planned as 
NASA’ next big astrophysics mission after the 
James Webb Space Telescope, which is due to 
launch next year. 

A recent independent review found that 
WFIRST’s cost could not be kept beneath the 
$3.2-billion cap set by NASA (see go.nature. 
com/2btsw3z), and the agency has been work- 
ing to revise the mission's design to reduce its 
price. US astronomers ranked WFIRST the 
top large mission in a 2010 survey of science 
priorities for the next decade. 


ROBERT NICKELSBERG/GETT’ 
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the White House agreed on a final funding deal. 
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Trump’s budget would boost funding for research on fossil fuels, such as coal used in this power plant. 


“Cancelling WFIRST is an unqualified disas- 
ter for the astronomical community,’ says Jon 
Morse, chief executive of the BoldlyGo Institute 
in New York City and former head of astrophys- 
ics at NASA. If the Congress does not move to 
continue the mission, “this would be the first 
time in the history of astrophysics decadal sur- 
veys dating back to the 1960s that the highest- 
priority mission wasn't accomplished”. 

Overall, NASA would receive $19.9 billion 
under the Trump plan, a 1.3% increase from 
the 2017 level. The agency's science directorate 
would receive $5.895 billion, a 2.3% increase. 

Funds for basic research at the DOE's Office 
of Science would remain flat, at $5.4 billion. 
Basic energy sciences would see a nearly 2% 


US President Donald Trump’s proposed 2019 budget would hold research funding steady at some agencies, 
but others would be in line for deep cuts compared with 2017 levels — the last year for which Congress and 


Funding would 
stay flat. 


-10 10 


-20 
Percentage change from 2017 


-30 


increase, to $1.85 billion, but the big winner 
would be advanced scientific computing, whose 
budget would rise by nearly 42%, to $899 mil- 
lion. The nuclear-fusion project ITER, under 
construction in France, would receive $75 mil- 
lion — $25 million above the 2017 level. 

For the second year in a row, the White House 
is proposing to kill the Advanced Research 
Projects Agency—Energy, which supports 
high-risk energy research. The agency received 
$306 million in 2017, but the Congress is split 
over its future. The House’s 2018 funding bill 
for the DOE would have eliminated the agency, 
whereas Senate appropriators approved a fund- 
ing boost, to $330 million. 

Climate-change programmes would take hits 
across multiple agencies. The Trump budget 
would eliminate funding for global-warming 
research at the EPA, whose overall funds would 
drop to $6.1 billion, from $8.1 billion in 2017. 
The plan would eliminate competitive grants 
for climate-change research at NOAA, and halt 
research to better understand how global warm- 
ing is affecting the Arctic. The agency’s overall 
budget would fall by 20%, to $4.6 billion. 

It is not clear how Congress will receive the 
Trump plan. So far, lawmakers have largely 
rejected the president's ongoing push for major 
cuts at federal science agencies. The 9 February 
budget deal, for example, included a provision 
to boost NIH spending by $2 billion in 2018 
— even though the bulk of the government is 
operating under a temporary spending plan 
that expires in late March. 

Michael Lubell, a physicist at the City Col- 
lege of New York, says that with the spending 
cap for 2019 now lifted, science agencies will 
be competing with other parts of the federal 
government for extra cash. That process is just 
beginning. “There are a lot of feeders at the 
trough,” Lubell says. “Time will tell? m 
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Plan to end yaws 
disease queried 


Bid fails to account for wild 
primates as hosts of infection. 


BY LUCAS LAURSEN 


lobal health officials are intensifying 
ex to eradicate yaws, a disfigur- 

ing skin disease that infects more than 
64,000 people a year in 14 African and south- 
east Asian countries. But some critics say that 
the plans could fail, because they don’t take into 
account discoveries in the past few years that 
wild primates harbour the bacterial infection. 

Public-health officials met in Geneva, 
Switzerland, last month to discuss how to 
expand the eradication programme in 6 of the 
14 countries in which yaws is endemic. But 
they did not discuss the part played by wild 
animals. “Even if this is not the main cause of 
re-emerging yaws nowadays, it would jeop- 
ardize global eradication,’ says Sascha Knauf, 
who studies neglected tropical diseases at 
the Leibniz Institute for Primate Research in 
G6ttingen, Germany. 

Five years ago, the World Health Organiza- 
tion (WHO) committed to eradicating yaws by 
2020, motivated in part by the discovery that 
it can be treated using an easy-to-administer 
oral antibiotic, azithromycin. The WHO esti- 
mated that the initiative would cost at least 
US$100 million. At the time, public-health 
officials thought that the disease occurred only 
in humans. Eradicating a disease that affects 
only people is much easier than one that also 
occurs in wild animals. 

However, Knauf reported in 2011 (ref. 1) 
and 2013 (ref. 2) that gorillas, chimpanzees, 
baboons and smaller primates in several West 
and Central African countries were infected 
with the same bacterium that causes yaws 
(Treponema pallidum pertenue). 

Epidemiologist Michael Marks at the 
London School of Hygiene and Tropical Medi- 
cine says that scientists have not yet shown that 
humans can catch the disease from primates. 
Even so, “it would be remiss not to pay atten- 
tion to it’, he says. 

The WHO is still waiting for such proof, 
says its medical officer for the disease, Kingsley 
Asiedu. In the meantime, Asiedu says, “we are 
not taking that into account, because there 
has not been proof of an epidemiological link 
between those yaws-like cases that have been 
found in primates and in humans”. m 


1. Knauf, S. et al. Vet. Pathol. 49, 292-303 (2011). 


2. Knauf, S., Liu, H. & Harper K. N. Emerg. Infect. Dis. 
19, 2058-2060 (2013). 
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BY SARAH WILD 


ata scientists in South Africa are 
D preparing to be inundated by a flood 

of information that is due to crash 
over them when the country’s biggest radio 
telescope doubles the scale of its operations 
in March. 

A terabyte-an-hour data deluge, which 
would fill more than three DVDs a minute, 
will flow from a network of radio dishes 
called the MeerKAT array. Currently consist- 
ing of 32 operational dishes, the array will 
expand to 64 next month. 

The impending flood of data is just a 
trickle compared with what will arrive 
after 2020, when international astronomers 
begin to expand MeerKAT to form part of 
the Square Kilometre Array (SKA). That 
will be the world’s largest radio telescope, 
and astronomers are trying to develop the 
expertise to handle torrents of data ahead of 
its full opening in 2026. South African data 
scientists also want to transfer their exper- 
tise to areas such as Earth observation and 
bioinformatics. 


018 


uth Africa readies 
for data deluge 


Nation seeks to retain control over its information. 


“We are building a system that empowers 
scientists, so that they can be part of pro- 
cessing the data — a system that allows the 
researchers to work with the data itself and 
work with the analytics, as if it was on their 
desktops,” says astronomer Russ Taylor, who 
divides his time between the University of 
Cape Town in South Africa and the Univer- 
sity of the Western Cape in the same city. 

The MeerKAT array is designed to collect 
relatively weak radio signals from space and 
combine them to extract more informa- 
tion. To convert it into the first phase of the 
SKA, engineers will initially add another 
136 dishes to the MeerKAT site in the North- 
ern Cape province of South Africa, and con- 
nect them to 130,000 antennas scattered 
across Western Australia. 


NUMBER CRUNCHING 

Data from the SKA will be shared with 
scientists from ten partner countries. But for 
now, South Africa is keen to retain control 
of its MeerKAT data rather than exporting 
them to other countries that already have 
data-processing infrastructure, says Taylor. 
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This is partly because distributing data is 
very expensive. “Fibre optics to connect two 
points in urban areas cost thousands of US 
dollars per mile,’ says Ugo Varetto, acting 
executive director at Australia’s Pawsey Super- 
computing Centre in Perth, which crunches 
and stores data from existing radio telescopes 
dotted over Australia. “In extreme environ- 
ments or underwater, that’s hundreds of 
thousands of dollars.” 

Astronomers also highly value the data 
produced by their telescopes and don’t want to 
send them elsewhere, says J. J. Kavelaars, group 
leader at the Canadian Astronomy Data Centre 
in Victoria, British Columbia. “All the effort 
of collecting the observations is expressed in 
those data files. Sending those data out of your 
jurisdiction is like shipping diamonds overseas 
for cutting,” he says. 

Because the astronomy data sets will be very 
large and will sit in geographically separate 
databases, scientists need to develop software 
tools to access them and bring them together 
in an efficient manner, says Mattia Vaccari, a 
data scientist at the University of the Western 
Cape. Taylor says he hopes that other African 
countries could use the same tools to develop 
their own cloud-based infrastructure. 


EXPLORING APPLICATIONS 

Within South Africa, others seek to take 
advantage of the data-crunching capabilities 
that the country is developing. These could 
be used for applications such as monitor- 
ing water resources or urbanization across 
the continent, says Val Munsami, head of 
the South African National Space Agency in 
Pretoria. 

Health-care officials would also like to get 
involved. Glaudina Loots, director of health 
innovation in the South African government's 
Department of Science and Technology, says 
that her unit plans to “piggyback” on the 
astronomy investment and data infrastruc- 
ture. “Part of that is earmarked for precision 
medicine. If you can't handle the data, and have 
to export it out of the country, then you start 
running into problems,” she says. 

“South Africa has one of the best hands in 
the game at this point,’ says Tony Beasley, an 
astronomer and head of the US National Radio 
Astronomy Observatory in Charlottesville, 
Virginia. “In terms of deployed science infra- 
structure, South Africa is way ahead.” = 
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US agency targets 
sexual harassment 


The National Science Foundation says institutions must 
disclose when grant recipients have violated policies. 


BY ALEXANDRA WITZE 


ny institution receiving grant 
Aves from the US National 

Science Foundation (NSF) must now 
inform the agency if it finds that anyone 
funded by the grant proposal has commit- 
ted sexual harassment. The policy will take 
effect once a 60-day public-comment period 
has ended. 

Until now, “we haven't had a requirement 
on universities to report a [harassment] 
finding or when they’ve put someone on 
administrative leave” during a harass- 
ment investigation, says France Cérdova, 
the NSF director. “We didn’t have the 
channel to find out what’s at the end of an 
investigation.” 

The reporting requirement comes in the 
wake of numerous sexual-harassment scan- 
dals in the sciences. It isa rare move among 
US federal research agencies, which gener- 
ally do not require grant recipients or their 
employers to disclose sexual-harassment 
allegations or findings. 

“Tt’s a big step in the right direction,” says 
Erika Marin-Spiotta, a biogeochemist at 
the University of Wisconsin—Madison who 
is co-leading a US$1.1-million initiative 
funded by the NSF to combat sexual and 
other forms of harassment in the sciences. 
But Marin-Spiotta says that agencies must 
do more to develop truly protective policies. 
Among other things, the NSF policy does not 
address what happens if an institution never 
completes an investigation. 

“At the end of the day, if the employing 
institution doesn't do its job, those who are 
affected will still be in a very difficult situ- 
ation,” says C. K. Gunsalus, who specializes 


in research integrity at the University of 
Illinois at Urbana-Champaign. 

The NSF notice, dated 8 February, is 
addressed to the heads of universities and 
colleges and other organizations that receive 
NSF funds. It requires them “to report find- 
ings of sexual harassment, or any other kind 
of harassment regarding a PI [principal 
investigator] or co/PI or any other grant 
personnel”. And it requires the institution to 
report if the PI or co-PI is placed on adminis- 
trative leave relating to a harassment finding 
or investigation. 

The notice also says that the NSF expects 
awardee organizations to lay out clear stand- 

ards for harassment- 


“If the . free workplaces, and 
employing processes by which 
institution students and others 
doesn’t doits can report prob- 
job, thosewho _lems. Workplaces are 
are affected defined to include 
will still bein conferences and 
avery difficult remote fieldwork 


sites, where students 
and young research- 
ers are often most vulnerable. The agency 
will solicit public feedback on the new rule 
in the coming weeks, through a posting in 
the Federal Register. 

Cérdova says that the burden of investi- 
gating harassment complaints typically 
rests with the institution that employs the 
person in question. The NSF accepts volun- 
tary reports through its Office of Diversity 
and Inclusion, but “we get vanishingly few 
complaints’, she says. 

Others note that institutions have 
differing policies on what constitutes sexual 
harassment; behaviour that might be > 


situation.” 
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> punished at one university — such as sexual 
relationships between faculty members and 
students — might not be at another. In 2016, 
investigators at the University of Rochester 
concluded that cognitive-sciences professor 
Florian Jaeger did not violate university policies 
on sexual harassment. A separate university- 
commissioned inves- 


tigation reported last “Grant money is 
month that Jaeger veryimportant 
had had sexual rela- to scientists 
tionships with four and their 
prospective, current institutions, 

or former students, gsothis policy 
and that these did will definitely 
not violate university help change the 
policies. culture.” 


In a response to 
that report, Jaeger said the relationships were 
consensual and that he did not sexually harass 
any students. 


UNDER SCRUTINY 

Like other federal agencies, the NSF is under 
pressure from the US Congress to strengthen 
its response to sexual harassment. In January, 
the House of Representatives’ science com- 
mittee asked the Government Accountability 
Office to look into sexual harassment involv- 
ing federally funded researchers at agencies 
including the NSE, NASA, the Department of 


Energy and the National Institutes of Health. 

The action in Congress was prompted in 
part by an investigation at Boston University 
in Massachusetts. The university found 
that one of its professors, David Marchant, 
had violated campus policies on sexual 
harassment while on NSF-funded field- 
work in Antarctica. (Marchant denies that 
he engaged in inappropriate behaviour, and 
he is appealing against the university finding, 
his lawyer says.) 

Some scientific societies have stepped up 
their policies against sexual harassment in 
recent years. Last September, the American 
Geophysical Union (AGU) changed its integ- 
rity and ethics policy to classify discrimination, 
sexual harassment and bullying as professional 
misconduct. 

The fact that a funding agency is taking 
action is crucial, says Robin Bell, a geophysicist 
at the Lamont-Doherty Earth Observatory in 
Palisades, New York, and AGU president-elect. 
“Linking reporting of harassment to funding is 
the next step the scientific enterprise can take 
to stop bullying and harassment by making the 
consequences clear,’ she says. 

Meg Urry, an astronomer at Yale University 
in New Haven, Connecticut, agrees. “T 
think it will make a big impact,” she says. 
“Grant money is very important to scien- 
tists and their institutions, so this policy will 


definitely help change the culture.” 

But much remains to be done to further 
protect those who have been harassed and 
to lessen the roadblocks to reporting, says 
Marin-Spiotta. “You could imagine a postdoc 
thinking, my advisor’s going to lose all the 
funding I need to do my work,’ she says. 

One possible solution, she says, could be to 
route funding directly to students or postdocs, 
or to their academic department, rather than 
to their principal investigator. m 


CORRECTIONS 

The News story ‘Super-invasive crayfish 
revealed to be a genetic hybrid’ (Nature 

554, 157-158; 2018) incorrectly stated that 
Julie Jones was the first to identify marbled 
crayfish in Madagascar. In fact, another team 
made the discovery; Jones and her team 
were the first to survey the species there. 

The News story ‘Israeli fossils hint at early 
migration’ (Nature 554, 15-16; 2018) gave 
the wrong URL for reference 1. It should 
have been http://dx.doi.org/10.1126/ 
science.aap8369. 

The Editorial ‘Maths revision’ (Nature 
554, 146; 2018) mistranslated the name of 
the Leibniz Institute. It is actually the Leibniz 
Institute for Information Infrastructure. 
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BY DAVIDE CASTELVECCHI 


Quantum physics can already make communications 
super-secure. But exploiting some ofits strangest 


properties could take these networks to the next level. 


Stephanie Wehner is 
part of the team trying to 
build a true quantum 
network across Europe. 


efore she became a theoretical physicist, 
Been Wehner was a hacker. Like 

most people in that arena, she taught 
herself from an early age. At 15, she spent her 
savings on her first dial-up modem, to use at 
her parents’ home in Wiirzburg, Germany. 
And by 20, she had gained enough street cred 
to land ajob in Amsterdam, at a Dutch Internet 
provider started by fellow hackers. 

A few years later, while working as a network- 
security specialist, Wehner went to university. 
There, she learnt that quantum mechanics 
offers something that today’s networks are 
sorely lacking — the potential for unhackable 
communications. Now she is turning her old 
obsession towards a new aspiration. She wants 
to reinvent the Internet. 

The ability of quantum particles to live in 
undefined states — like Schrédinger’s pro- 
verbial cat, both alive and dead — has been 
used for years to enhance data encryption. But 
Wehner, nowat Delft University of Technology 
in the Netherlands, and other researchers argue 
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that they could use quantum mechanics to do much more, by harnessing 
nature's uncanny ability to link, or entangle, distant objects, and teleport- 
ing information between them. At first, it all sounded very theoretical, 
Wehner says. Now, “one has the hope of realizing it”. 

Proponents say that such a quantum internet could open up a whole 
universe of applications that are not possible with classical communi- 
cations, including connecting quantum computers together; building 
ultra-sharp telescopes using widely separated observatories; and even 
establishing new ways of detecting gravitational waves. Some see it as 
one day displacing the Internet in its current form. “I’m personally of 
the opinion that in the future, most — if not all — communications 
will be quantum,” says physicist Anton Zeilinger at the University of 
Vienna, who led one of the first experiments 
on quantum teleportation’, in 1997. 

A team at Delft has already started to build 
the first genuine quantum network, which 
will link four cities in the Netherlands. The 
project, set to be finished in 2020, could be 
the quantum version of ARPANET, a com- 
munications network developed by the US 
military in the late 1960s that paved the way 
for today’s Internet. 

Wehner, who is involved in the effort, is also 
coordinating a larger European project, called 
the Quantum Internet Alliance, which aims to 
expand the Dutch experiment to a continental 
scale. As part of that process, she and others are trying to bring computer 
scientists, engineers and network-security experts together to help design 
the future quantum internet. 

Many technical details still need to be sorted out, and some researchers 
caution that it is too early to say exactly how much a quantum internet 
might deliver. But by thinking about security early, Wehner says that 
she hopes to avoid the vulnerabilities that the Internet inherited from 
ARPANET. “Maybe we have a chance to do it all right from the start.” 


QUANTUM KEYS 

The first proposals for quantum modes of communication date back to 
around the 1970s. Stephen Wiesner, then a young physicist at Columbia 
University in New York City, saw potential in one of the most basic prin- 
ciples of quantum mechanics: that it is impossible to measure a property 
ofa system without changing it. 

Wiesner suggested that information could be encoded in the states of 
objects such as isolated atoms, whose ‘spins’ can point up or down — like 
the 0 and 1 of classical bits — but can also be in both states simultane- 
ously. Such units of quantum information are now commonly called 
quantum bits, or qubits. Wiesner pointed out that because the properties 
ofa qubit can’t be measured without changing its state, it is also impos- 
sible to make exact copies or ‘clones’ of one’. Otherwise, someone could 
extract information about the state of the original qubit without affecting 
it, simply by measuring its clone. This prohibition later became known as 
quantum no-cloning, and it turns out to be a boon for security, because 
a hacker cannot extract quantum information without leaving a trace. 

Inspired by Wiesner, in 1984, Charles Bennett, a computer scientist at 
IBM in Yorktown Heights, New York, and his collaborator Gilles Brassard, 
at the University of Montreal in Canada, came up with an ingenious 
scheme by which two users could generate an unbreakable encryption 
key that only they know’. The scheme depends on the fact that light can 
be polarized, so that the electromagnetic waves oscillate either in a hori- 
zontal or a vertical plane. One user converts a random sequence of 1s 
and 0s into a quantum key encoded in those two polarization states and 
sends it streaming to another person. Ina sequence of steps, the recipient 
measures the key and establishes that the transmission was not disturbed 
by the measurements of an eavesdropper. Confident in the security of the 
key, the two parties can then scramble any message made up of classical 
bits — an image, for example — and send it just as they would any other 
encrypted message over the conventional Internet, or any other channel. 


290 | NATURE | VOL 554 | 15 FEBRUARY 2018 


“MAYBE WE HAVE 
A CHANCE TO DO IT 
ALL RIGHT FROM 
THE START.” 


In 1989, Bennett led the team that first demonstrated this ‘quantum 
key distribution (QKD) experimentally’. Today, QKD devices that use 
similar schemes are commercially available and typically sold to financial 
or government organizations. ID Quantique, for example, a company 
founded in 2001 in Geneva, Switzerland, built a quantum link that has 
been protecting the results of Swiss elections for more than ten years. 

Last year, China’s Micius satellite, the brainchild of physicist Pan 
Jianwei of the University of Science and Technology of China in Hefei, 
made some of the flashiest demonstrations of the approach. Using a vari- 
ant of Bennett and Brassard’s protocol, the spacecraft created two keys, 
then sent one to a ground station in Beijing and another to Vienna as it 
passed overhead. An on-board computer then combined the two secret 
keys to create a new one, which it beamed 
down classically. Armed with their private 
keys, the Vienna and Beijing teams could 
unscramble that combined key by essentially 
subtracting their own, and so learn the other's 
secret key. With both keys, one team could 
decrypt a transmission that the other team 
encrypted with its key. Last September, Pan 
and Zeilinger used this approach to set up the 
first intercontinental video chat to be secured 
in part with a quantum key’. 

Satellites such as Micius could help to 
address one of the main challenges in mak- 
ing today’s quantum communications secure: 
distance. The photons needed to create an encryption key can get 
absorbed by the atmosphere or — in the case of ground networks — by 
an optical fibre, which renders quantum transmission impractical after 
several tens of kilometres. 

Because quantum states cannot be copied, it is not an option to send 
multiple copies of a qubit in the hope that at least one will arrive. So, at 
the moment, creating long-distance QKD links requires building ‘trusted 
nodes to act as intermediaries. If a person were to hack into a trusted 
node, which handles keys in both their quantum and classical forms, 
they would be able to copy the keys without being detected — and so, of 
course, could the government or company operating the node. This is 
true both for trusted nodes on the ground and for Micius. “The satellite 
knows everything,” Pan says. But passing satellites could cut down on 
the number of trusted nodes that are needed to connect distant points. 

Pan says that trusted nodes are already a step forward for some appli- 
cations, because they reduce the number of spots where a network 
is vulnerable to attack. He has also led the creation of the extensive 
Beijing-Shanghai quantum-communication backbone. Launched in 
September, this connects 4 cities with 32 trusted nodes using more than 
2,000 kilometres of optical fibre, and is being tested for banking and com- 
mercial communications, such as linking up the data centres of Internet- 
shopping giant Alibaba, Pan says. 


QUANTUM CONNECTIONS 

But networks that involve trusted nodes are only partly quantum. 
Quantum physics plays a part only in how the nodes create the encryp- 
tion key; the subsequent encryption and transmission of information 
is entirely classical. A true quantum network would be able to harness 
entanglement and teleportation to transmit quantum information over 
long distances, without the need for vulnerable trusted nodes. 

One of the main motivations for building such networks is to enable 
quantum computers to talk to each other, both between countries and 
across a single room. The number of qubits that can be packed into 
any one computing system may be limited, so networking the systems 
together could help physicists to scale them up. “At this point, it’s fair to 
say that probably you'll be able to build a quantum computer with maybe a 
couple of hundred qubits,’ says Mikhail Lukin, a physicist at Harvard Uni- 
versity in Cambridge, Massachusetts. “But beyond that, the only way to do 
this is use this modular approach, involving quantum communications” 

Ona larger scale, researchers envision a quantum-computing cloud, 
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with a few highly sophisticated machines that are accessible through a 
quantum internet from most university labs. “The extra cool thing is that 
such cloud quantum computing is also secure,” says Ronald Hanson, an 
experimental physicist at Delft. “People at the server are unable to know 
what kind of program youre running and the data you have” 

Researchers have come up with a plethora of other proposals for 
Internet applications — such as auctions, elections, contract negotia- 
tions and speed trading — that could exploit quantum phenomena to 
be faster or more secure than their classical counterparts. 

But the biggest impact of a quantum internet could be on science itself. 
Synchronizing clocks using entanglement could improve the precision 
of Global Positioning System-like navigation networks from metres to 
millimetres, some researchers say. And Lukin and others have proposed 
using entanglement to combine distant atomic clocks into a single clock 
with vastly improved precision, which he says could lead to new ways 
of detecting gravitational waves, for example. In astronomy, quantum 
networks might link distant optical telescopes across thousands of kilo- 
metres, to effectively give them the resolution of one dish spanning the 
same distance. This process, called very long baseline interferometry, is 
applied routinely in radio astronomy, but operating in optical frequencies 
requires timing precision that is currently out of reach. 


SPOOKY SECURITY 

In the past decade or so, experiments pioneered by Christopher 
Monroe’, a physicist at the University of Maryland in College Park, and 
others have demonstrated some of the fundamentals needed to build 
a truly quantum network, such as teleporting information encoded in 
qubits from one place to another (see ‘Creating a quantum internet’). 

To see how teleportation (also proposed by Bennett and Brassard’) 
works, imagine two users: Alice and Bob. Alice holds a qubit, which 
could be a trapped ion or some other quantum system, and wants to 
transfer the information stored in it to Bob. As luck would have it, Alice 
and Bob come into possession of two ‘proxy’ particles — also qubits 
— that are entangled with each other. If Alice can entangle her qubit 
and proxy particle, the qubit will, by extension, also become entan- 
gled with Bobs particle. To do so, Alice performs a particular kind of 
joint measurement on her two particles. She then shares the results of 
that measurement (which are ordinary, classical data) with Bob. To 
complete the teleportation process, Bob then uses that information 
to manipulate his particle so that it ends up in the same state as Alice's 
qubit originally was. 

For practical purposes, it doesn’t matter how Alice and Bob obtain 
the entangled proxy particles. They could be individual atoms deliv- 
ered in a briefcase, say, or photons beamed to the pair by a third party. 
(One of Micius’s experiments last year sent entangled pairs of photons 
to two ground stations in China over a record distance of more than 
1,200 kilometres.) Alice and Bob could also entangle the qubits they 
hold, by sending photons out to interact at a third location. 

The beauty of quantum teleportation is that the quantum information 
does not technically travel along the network. The photons that do travel 
are just used to establish a link between Alice and Bob so that quantum 
information can then be transferred. If one pair of entangled photons 
fails to establish a connection, another pair will. This means that the 
quantum information is not lost if photons are. 


LINK AND REPEAT 

A quantum internet would be able to produce entanglement on demand 
between any two users. Researchers think that this will involve send- 
ing photons through both fibre-optic networks and satellite links. But 
connecting distant users will require a technology that can extend the 
reach of entanglement — relaying it from user to user and along inter- 
mediate points. 

One way in which such a quantum repeater could work was proposed 
in 2001 by Lukin and his collaborators®. In their scheme, small quantum 
computers that can store qubits and do simple operations on them are 
used to entangle a qubit at an upstream station with one downstream. 
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Researchers expect that a fully quantum 
network will need to establish entangled 
links between any two users. Quantum 
information will then be teleported from 
one to the other, transferring the 
information without transmitting it 

over the network. 


ESTABLISHING ENTANGLEMENT 
Qubit 


Entanglement 


Entanglement links distant particles, so that if one changes state, so 
does the other. Entangled quantum bits, or qubits, effectively form a 
quantum communication channel that spans any distance. 


To create a link, Alice can emit a photon towards Bob from her qubit. 
Bob does the same towards Alice. Because the photons are entangled 
with their original qubits, when they interact, Alice’s and Bob’s qubits 
become entangled, too. 


lf Alice and Bob are far away from one another or not directly connected, 
one or more quantum repeaters will be needed to establish entanglement. 
Here, one qubit in the repeater is entangled with Alice’s qubit, and the 
other with Bob’s. By performing an operation on the two qubits it holds, 
the repeater creates entanglement between Alice’s and Bob’s qubits. 


If Alice then wants to transmit quantum information over the connection, 
she can perform a particular kind of measurement on her entangled qubit 
(pink) and the qubit whose state she wants to teleport (grey). The grey 
qubit is now entangled with Alice’s other particle and with Bob’s. 


Alice then sends information about the measurement to Bob, which 
can be done over the conventional Internet. With that information, 
Bob can perform an operation on his qubit that places it in the same 
state as Alice’s second qubit was originally. The quantum state of 
Alice’s particle — its quantum information — has been teleported. 
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Part of an experiment to investigate diamond-based systems as 
quantum-internet nodes at Delft University of Technology in the Netherlands. 


Repeated application of this ‘entanglement swapping process along a path 
in a network would eventually produce entanglement between any two 
users. 

In 2015, Hanson and his collaborators showed how to build one leg of 
a network when they linked two qubits built from single-atom impurities 
in diamond crystals and separated by 1.3 kilometres’. Photons emitted 
by the two qubits travelled towards an intermediate station, where they 
then interacted, establishing entanglement. “It shows that one can really 
establish entanglement — strong, reliable entanglement — between two 
distant quantum-information processors,’ says Seth Lloyd, a physicist at 
the Massachusetts Institute of Technology in Cambridge. 

Researchers are investigating other ways to construct and manipu- 
late qubits, including using individual ions suspended in a vacuum — 
pioneered by Monroe and others — as well as systems that pair up atoms 
and photons bouncing between two mirrors inside a cavity. 

Like Hanson’s diamond system, these qubits could be used to build 
both quantum repeaters and quantum computers. Fortunately for 
people hoping to ramp up quantum communications, the requirements 
for a repeater may be less demanding than those for a fully fledged quan- 
tum computer. Iordanis Kerenidis, a quantum-computation researcher 
at the University of Paris Diderot, made this argument at a workshop 
on quantum repeaters in Seefeld, Austria, last September. “If you tell 
experimentalists that you need 1,000 qubits, they are going to laugh,” he 
said. “If you tell them you need ten — well, they laugh less.” 

The prospect of creating a quantum internet is now becoming a 
problem of systems engineering. “From an experimental point of 
view, people have demonstrated various building blocks” for quantum 
networks, says Tracy Northup, a physicist at Austria's University of 
Innsbruck whose team works on cavity qubits and is part of Wehner’s 
pan-European Quantum Internet Alliance. “But putting them together 
in one place — we all see how challenging it is,” Northup says. 
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For the moment, Wehner’s alliance is still at an early stage and looking 
for public funding as well as corporate partners. In the meantime, the 
Dutch demonstration network, led by Hanson, has been moving for- 
ward. Hanson and his colleagues have been improving the speed of their 
systems, which in the 2015 experiment entangled just 245 qubit pairs 
over the equivalent of about 9 days. Another crucial challenge has been 
to reliably convert photons from the visible wavelengths that come out 
of the diamond qubits to longer, infrared ones that can travel well along 
optical fibres; this is tricky because the new photon still has to carry 
the quantum information of the old one, but without the possibility of 
cloning it. Earlier this year, Hanson and his colleagues achieved this by 
making photons interact with a laser beam of longer wavelength”. That 
technique would enable qubits to be linked over distances of tens of 
kilometres over fibre. 

Hanson's team is now building a link between Delft and The Hague, 
a good 10 kilometres away. By 2020, the researchers hope to have 
connected up four Dutch cities, with a station at each site acting as a 
quantum repeater. If successful, the project would be the world’s first 
genuine quantum-teleportation network. The group aims to open it 
up to other teams interested in performing quantum-communications 
experiments remotely, much like IBM’s Quantum Experience, which 
allows remote users access to arudimentary quantum computer. 

The network could be a test bed for researchers hoping to fix some of 
the Internet's flaws, not least the ease with which users can forge or steal 
identities. “The idea that you could join a network without establishing 
identity is a problem from early on,’ said Robert Broberg, a network 
engineer from the telecommunications equipment giant CISCO, at 
the Seefeld meeting. Wehner and others have proposed quantum tech- 
niques that would allow users to prove their identity by certifying that 
they own the correct secret code (a series of classical bits) without ever 
transmitting it. Instead, the user and the server use the code to create 
a sequence of qubits and send them to a ‘black box’ in between. The 
black box — which could be, say, a cash machine — can then compare 
the two sequences to see whether they match, without ever knowing 
the underlying code. 

But some researchers caution against overselling the potential reach 
of the technology. “Today’s Internet will never be entirely quantum, no 
more than computers will ever be all-quantum,’ says Nicolas Gisin, a 
physicist at the University of Geneva in Switzerland and a co-founder of 
ID Quantique. And it could be that many of the things people hope to 
achieve with quantum networks could be done with more conventional 
technologies. “Sometimes, something looks like a great idea at first, and 
then it turns out to be easily achievable without a quantum effect,” says 
Norbert Liitkenhaus, a physicist at the University of Waterloo in Canada 
who is helping to develop standards for the future quantum internet. 

Time will tell whether the promise of the quantum internet will 
materialize. As far as we know, teleportation is a phenomenon that, 
although physically possible, does not occur in nature, Zeilinger says. 
“So this is really new for humanity. It might take some time.” 

Wehner’s familiarity with both physics and network security has 
made her a point of reference for people in the field. And after having 
done much work on hard-core quantum theory, she is relishing the 
opportunity to shape these future networks. “For me,’ she says, “this is 
really full circle.” m 


Davide Castelvecchi is a senior reporter for Nature in London. 
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Studies of ageing have tended to focus on lifespan rather than on years of good health. 


Find drugs that delay many 
diseases of old age 


Simply extending lifespan is not enough. We need treatments that boost resilience to 
multiple age-related diseases, argue Ilaria Bellantuono and 12 co-signatories. 


years old is expected to reach 2 billion 

worldwide, or 22% of the population. 
In 2015, it was 12% of the population. And 
for many, a longer life will mean more years 
of chronic diseases, such as arthritis, type 2 
diabetes, cancer and Alzheimer’s disease. 

In Europe, for example, the estimated 
life expectancy for women born in 2014 
is 1.6 years longer than for those born in 
2006; for men, it is 2.3 years longer. But the 


B: 2050, the number of people over 60 


women born in 2014 can expect 0.7 fewer 
years of good health, and the men zero 
(see ‘More years of what?’). As this trend 
continues, health-care and other costs are 
expected to soar. 

A class of drugs called geroprotectors might 
be able to delay the onset of concurrent age- 
related diseases (multimorbidity) and boost 
resilience. In various animal models, these 
drugs can ward off problems of the heart, 
muscles, immune system and more. And in 


2014, investigators reported the results of the 
first clinical trial of a geroprotector in people 
over 65: the drug, RAD001, boosted immune 
responses to an influenza vaccine’. 

More than 200 compounds have now been 
classified as geroprotectors. But various fac- 
tors are preventing these drugs from reach- 
ing the clinic. 

Here we set out three steps required 
to speed up translation. These have been 
distilled over the past three years by > 
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> a network of academic and industry 
scientists, clinicians and regulators called 
COST Action BM1402: MouseAGE. The 
network, which is funded by the European 
Union, has worked with other experts in 
these domains worldwide. 


MATURING FIELD 

Over the past decade, understanding of the 
physiological changes that occur as people 
age has improved a great deal. 

Common mechanisms seem to under- 
pin several age-related diseases, including 
diabetes, Parkinson's disease and Alzhei- 
mer’s. A review of more than 400 studies 
of people and animal models indicates that 
similar mechanisms underlie six conditions’. 
These can involve DNA damage, such as that 
caused by free radicals; cellular senescence (in 
which cells stop dividing and start secreting 
inflammatory factors); or inflammation and 
autophagy (the degradation of organelles, 
misfolded proteins and so on)’. 

This may explain why people over 65 
are at a higher risk than younger people of 
developing more than one disease at the 
same time. In the United States, 7 out of 10 
people over 65 with diabetes will die of heart 
disease, for instance. 

It is also becoming clear that one age- 
related disease can accelerate the onset of 
others. A 2014 study’ showed that people 
older than 75 who already had diabetes, say, 
were more than twice as likely to develop 
another disease over the next three years 
than those who were healthy at the begin- 
ning of the study. 

Also, between one-quarter and half of 
people over the age of 80 become frail. The 
accumulation of deficits makes it harder for 
them to recover from an infection, fall or 
other minor stressor*. It is unclear whether 
multimorbidity leads to frailty or vice versa, 
or whether they are independent. 

Until now, ageing research has focused 
mainly on single diseases, or on delaying 
death. This means that the fundamental 
mechanisms of ageing as targets for the treat- 
ment or prevention of several age-related 
conditions are being missed. What’s more, 
patients with multimorbidity are being 
exposed to many drugs at once, often with 
adverse effects”. 


THREE STEPS TO TRANSLATION 
Research in mice over the past few years 
suggests that it might be possible to delay 
the decline of many tissues and the onset 
of disease. Drugs such as rapamycin, 
metformin or senolytics (which remove 
senescent cells), can slow the development 
of cataracts, osteoarthritis, osteoporosis, the 
loss of muscle mass and can improve cardiac 
function”®*. 

If these are to become useful in people, 
three advances must be made. 


MORE YEARS OF WHAT? 


In Europe, men and women are living longer. 
They are also spending more years with 
chronic conditions such as diabetes, cancer 
and Alzheimer's disease. 


@ Years spent with 
chronic disease 


© Healthy years 


Men born in 2014 
are expected to 
experience three 
more years with 
chronic disease than 
those born in 2006. 


Age (years) 


as coal icin Heal 


2007 2010 2013 


60 yprercrepeerepe 
2007 2010 2013 


Agree on definitions. For drugs to be 
approved for clinical use, developers must 
first identify the indication — the condition 
that enables investigators to obtain authori- 
zation to test a drug treatment in people. 
Professional organizations, such as the 
International Association of Gerontology 
and Geriatrics, must agree on definitions 
of multimorbidity and frailty. And institu- 
tions such as the World Health Organization 
(WHO) and the US Centers for Medicare and 
Medicaid Services (CMS) need to accept this 
consensus. (The CMS issues codes that affect 
people’ ability to claim health insurance for 
the treatment of a particular condition.) 

There is no standardized definition of 
multimorbidity. Some researchers use 
the term to describe the co-occurrence of 
2 diseases, some 5, others 13 and so on. 
This makes it hard to compare studies and 
to establish which diseases have a higher 
chance of occurring together. 

Agreeing to a definition will require: 
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working out which of the 5 or 10 most 
burdensome age-related diseases, say, to 
prioritize; which of these are most likely 
to associate together; the timing of devel- 
opment of each disease with respect to the 
others in the cluster; and the identification of 
the pathways associated with their common 
pathogenesis. 

Likewise, there is no consensus on the 
definition of frailty, although clinicians 
and researchers are generally in agreement 
about its usefulness as a clinical term. Nor 
are there standardized assessments of frailty. 
Some describe a person as frail if they show 
three or more of the following: weakness, 
slowness, low levels of physical activity, 
self-reported exhaustion and unintentional 
weight loss. A more quantitative approach 
attempts to capture the accumulation of defi- 
cits (including hearing loss, low mood and 
dementia) ina ‘frailty index””®. 

Over the past three years or so, at least 
six major international societies have held 
meetings to try to encourage scientists and 
clinicians to reach a consensus on the defi- 
nition of frailty''. These have involved the 
International Association of Gerontology 
and Geriatrics and the International Acad- 
emy of Nutrition and Aging, among others. 
Such efforts should be scaled up and accel- 
erated with the leadership of a coordinating 
body such as the WHO, the US Food and 
Drug Administration (FDA) or the Euro- 
pean Medicines Agency (EMA). 

Improve animal models. There are many 
mouse models for type 2 diabetes, Alzhei- 
mer’s disease, stroke, cardiovascular diseases, 
osteoarthritis and osteoporosis”. However, 
to save costs and time, the desired pathol- 
ogy is generally induced when lab mice are 
2 to 6 months old. (The average lifespan 
for the most commonly used lab mouse is 
22 months.) Researchers might delete a key 
gene, for instance, or remove a particular 
organ (such as the ovaries) to induce a dis- 
ease (such as osteoporosis). This means that 
the effects of cellular ageing on disease pro- 
gression are rarely taken into account. 

We urge investigators to develop mouse 
models in which diseases are induced in 
aged animals. 

Some researchers have proposed models 
of multimorbidity that use certain strains in 
which the ageing process is accelerated. Mice 
that have had the DNA-damage-repair gene 
Ercc1 deleted, for example, develop dysfunc- 
tion in multiple tissues'’. Again, we recom- 
mend that such phenotypes are induced in 
older mice. 

So far, the few attempts that have been 
made to develop mice models for age-related 
multimorbidity have met many challenges; 
in some cases, genetic alterations expected 
to worsen conditions have improved them. 

It could be that researchers need to delete 
multiple genes, with one gene — or several 
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genes — linked to each specific disease. 
Ultimately, other species, such as dogs or 
baboons, might provide better models for 
multimorbidity. 

Even if the creation of mouse models for 
age-related multimorbidity proves too diffi- 
cult, mice could still be used in initial testing, 
to confirm how a candidate geroprotector 
works, to observe the effect of a drug on 
tissue dysfunction and to study individual 
diseases in isolation. 

It might be easier, however, to establish 
good mouse models of frailty. 

The application of a frailty-assessment 
tool that considers 31 parameters, from grip 
strength to body weight (but not cognition) 
showed in 2014 that C57BL/6J mice develop 
mild frailty with age”. 

To develop models of more severe frailty, 
we recommend that, just as with morbidity, 
researchers characterize transgenic mouse 
models of premature ageing. As well as strains 
in which Ercc]1 is deleted, investigators could 
study mouse models of genetic disorders that 
result in premature ageing, such as for Werner 
syndrome or Hutchinson-Gilford syndrome, 
but induce the gene deletion in animals that 
are older than 12 months. We also urge inves- 
tigators to characterize models in which the 
fundamental mechanisms of ageing, such 
as inflammation or telomere maintenance, 
are disrupted. Certain telomerase-knockout 
mice that have critically short telomeres, for 
instance, develop multiple deficits by the time 
the mice reach 6-12 months old. 

In all such efforts, we encourage research- 
ers to try to assess frailty in response to 
clinically relevant stressors, such as immo- 
bilization, surgery or the administration of 
chemotherapy or radiotherapy. 

Agree on desired metrics. Assessing 
geroprotectors in preclinical trials and in 
the clinic requires that investigators estab- 
lish which attributes to measure in model 
organisms and in patients. 

We recommend that investigators focus 


Children visit a resident of a Japanese nursing home. 
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first on how to assess drugs effects on frailty, 
which will enable the design of clinical trials 
that are short and cost-effective. We also urge 
researchers to prioritize measurements that 
signal a tangible benefit for patients, because 
this will facilitate the regulatory approval 
required to test geroprotectors in the clinic. 
For example, measuring a persons ability to 
walk for 400 metres is preferable to measur- 
ing muscle mass because it improves her or 
his ability to live independently. Researchers 
should then identify the best correlates for 
these measurements in their study organisms. 

Various tools provide a starting point for 
monitoring resilience in frail elderly peo- 
ple. These include qualitative assessments 
obtained by asking individuals about their 
levels of fatigue, pain, weight loss and so on, 
or tests that monitor mobility such as gait 
speed or how long it takes a person to stand 
up and walk a certain distance. Regulatory 
authorities are especially likely to accept 
‘tmprovement in mobility’ as a target. Mobil- 
ity is a good predictor of disability, time 
spent in hospital, mortality and health-care 
expenditure”. It is easy to measure and its 
relationship to clinically meaningful change 
is well understood. 

We also advocate the development of 
better metrics of frailty in animals. Meas- 
urements that capture the health of the 
cardiovascular, cognitive, metabolic, neuro- 
muscular and musculoskeletal, and immu- 
nological systems should be made, along with 
other assessments of weight loss, responses 
to stressors and endurance, for example. 
Genomics and proteomics could be used to 
identify biomarkers for monitoring the early 
effects of interventions. 

All these advances will require 
infrastructures that enable scientists to work 
collaboratively using standardized meth- 
odologies. An example of the kind of thing 
that’s needed is INFRAFRONTIER, an EU- 
funded research network of interconnected 
mouse clinics that has been developed over 


the past ten years for those working on 
mouse genomes. 


IMMEDIATE ACTION 

Proof-of-concept clinical studies could 
demonstrate the value of geroprotectors as 
boosters of resilience in frail patients within 
the next decade. In fact, we ask industry, 
academic scientists and regulatory bodies, 
such as the FDA and EMA, to work together 
to pursue preclinical testing and clinical tri- 
als in frail patients with conditions such as 
chronic obstructive pulmonary disease, hip 
fractures and cancer immediately — even 
before standardized definitions of frailty and 
multimorbidity are thrashed out. 

Instead of waiting ten years or so for mul- 
tiple diseases to develop, researchers (using 
metrics such as mobility and time in hospital) 
might be able to demonstrate an effect within 
a few weeks. If successful, such studies could 
catalyse efforts to advance definitions, animal 
models and the characterization of measur- 
able outcomes against which to test the drugs. 

Many factors have stopped discover- 
ies about geroprotectors from changing 
patients lives. These include: the need for 
many stakeholders to work together; indus- 
try’s tendency to focus on the short term; 
and the way researchers’ performance is 
evaluated (numbers of papers published 
carries more weight than time spent help- 
ing the community establish definitions, for 
instance). With an ever-increasing ageing 
population, and the social and health-care 
systems of many nations close to crisis point, 
we must take a different approach. = 


Ilaria Bellantuono is a professor at the 
MRC-Arthritis Research UK Centre for 
Integrated research into Musculoskeletal 
Ageing, University of Sheffield, UK. 
e-mail: i.bellantuono@shef.ac.uk 
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A full list of co-signatories accompanies this 
article online (see go.nature.com/2bkouvk). 
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A1.8-million-year-old skull discovered in Dmanisi, Georgia, is one of the oldest hominin fossils found outside Africa. 


PALAEOANTHROPOLOGY 


Back to the present 


Maria Martinon-Torres weighs up an original study on human origins and evolution. 


any people assume that palaeo- 
anthropology deals only with 
the past. The thinking goes that, 


beyond a curious, somehow romantic inter- 
est in the early accounts of our ancestors, 
there is not much that this discipline can 
add to the understanding of present-day 
humans. South Korean palaeoanthropolo- 
gist Sang-Hee Lee disputes that view in Close 
Encounters with Humankind. She shows us 
to ourselves as the living (and, importantly, 
still changing) outcome of a wonderful inter- 
play between biology and natural selection 
over the roughly 6 million years since homi- 
nins diverged from the chimpanzee lineage. 

Avoiding the usual narrative, from 
bipedal ape-like creature to complex behav- 
iour, Lee offers an original journey along 
our singular evolutionary path. When did 
our ancestors lose their fur? Did the taste 
for meat change our destiny? Was farming 
a blessing or a curse? Is altruism unique to 
us? Succinctly and engagingly, Lee revisits 
these and other key questions about the 
story of our evolving species — and gives 
some unconventional answers. 


Notably, she sup- 
ports multiregion- 
alism. This is the 
theory that modern 
humans originated 
in many places 
simultaneously, in 
contrast to the ‘out 
of Africa model that 
posits a single ori- 
gin for our species. 
Thus, she counters 
the sometimes rigid 
interpretations of 
the fossil record 
propounded ina 
literature domi- 
nated by the English language and the 
Western scientific community. In her book, 
Asia makes a comeback as a birthplace of 
modern humans and their ancestors. Lee 
reminds us that the Dmanisi hominin fossils 
from the republic of Georgia are as old as the 
earliest Homo fossils found in Africa; and 
that Homo erectus might have originated in 
Asia and migrated “back into Africa” to give 


Close Encounters 
with Humankind: A 
Paleoanthropologist 
Investigates Our 
Evolving Species 
SANG-HEE LEE 

W. W. Norton: 2018. 
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rise to later Homo species. She also discusses 
the Denisovans, the mysterious hominins 
that coexisted with modern humans and 
left behind extensive DNA, but few fossils. 
She refers to them as “Asian Neanderthals” 
to highlight how the reconstruction of 
European hominins’ evolutionary story 
should not be disconnected from that of 
their Asian cousins. 

Not everything in Close Encounters with 
Humankind is about the past. Are humans 
still evolving? It’s commonly thought that 
our interaction with the world through 
culture and technology (such as clothes, 
tools or medicines) has buffered the pres- 
sure on our bodies to adapt biologically to 
the environment. Lee challenges this view 
and traces a cascade of other evidence for 
ongoing human evolution. She points to 
studies on skin colour as evidence. 

Dark skin is thought to have evolved 
in the first furless hominins in Africa, to 
protect against the ultraviolet radiation in 
intense direct sunlight. Hominins living in 
higher latitudes, went this line of reason- 
ing, would be exposed to less UV radiation, 


VALERIE KUYPERS/EPA/REX/SHUTTERSTOCK 


GRANGER/REX/SHUTTERSTOCK 


and so would need less-active melanocytes 
(the cells that produce the pigment mela- 
nin). That might largely explain the lighter 
skin of populations in regions farther from 
the Equator. However, studies by geneti- 
cist Iain Mathieson, now at the University 
of Pennsylvania in Philadelphia, and his 
colleagues on a large ancient-DNA sam- 
ple from western Eurasian populations 
revealed that the light skin of Europeans 
is due to a new gene variant that emerged 
no more than 4,000 years ago (I. Mathieson 
et al. Nature 528, 499-503; 2015). They link 
these populations’ lighter skin to the rise of 
agriculture and sedentary communal life- 
styles, a view Lee favours. 

As she shows, the shift to agriculture led 
to a diet based on processed grains and 
starches, which is deficient in many nutri- 
ents, including 
vitamin D. This 
deficiency forces 
the body itself to 
synthesize the vita- 
min — a metabolic 
process requiring 
the absorption of 
UV through the 
skin. The mutation 
for paler skin in 
Europeans pinpointed by Mathieson would 
maximize the UV absorption in popula- 
tions facing low vitamin D intake. With this 
example, Lee emphasizes how culture — in 
this case, agriculture and a change in diet — 
might even have accelerated evolution. 

Farming also led to a population 
explosion, despite increased vulnerability 
to infectious disease in settled communi- 
ties. The availability of cereals allowed earlier 
weaning of infants, and meant women could 
give birth at shorter intervals. The resulting 
population increase brought higher genetic 
diversity, “the raw material of evolution”. 
Another demonstration of how our biology 
is still subject to change is the lactase muta- 
tion that has allowed some humans over at 
least the past 5,000 years to digest milk into 
adulthood. This eccentricity, less common 
in East Asia (predominantly China), became 
a key advantage for pastoralists and might 
represent an additional mechanism for over- 
coming the scarcity of vitamin D, because 
cow’s milk is rich in the nutrient. 

Moreover, living in communities is central 
to our species’ success. As Lee notes, large 
groups became essential to survival because 
they offer assistance, to offset the difficul- 
ties of giving birth to big-brained babies 
and caring for them through a long infancy. 
Modern humans are also the longest-living 
primate species: three generations can over- 
lap in time. Individuals stay ‘useful’ beyond 
their reproductive period by taking care of 
their children’s offspring and even unre- 
lated infants. As Lee states, the concept of 


N SBS oo Se eee 


The site near Beijing where the 750,000-year-old ‘Peking Man’ 


“fictive kin” (close bonds with those outside 
family or marriage) is unique to humans. She 
notes the remains of an elderly hominin in 
Dmanisi, dated to 1.8 million years ago, that 
evidently survived for some time without 
teeth, at a time without sophisticated tools 
or the knowledge of how to control fire. That 
could indicate that the hominin was treated 
with compassion by the group: the fossil 
could be the earliest evidence of human 
altruistic behaviour. 

Lee’s style is breezy. A chapter entitled 
‘King Kong’ discusses Gigantopithecus, 
the puzzling gigantic ape found in China 
that might have coexisted with Homo 
erectus from 1.2 million to 300,000 years 
ago. ‘Breaking Back looks at back pain as 
a trade-off of bipedalism. That accessibil- 
ity sometimes risks over-simplifying, and 
occasionally strays into territory where 
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Homo erectus fossils were found. 


every trait seems to have a function or to 
have evolved for a use. 

Yet, ultimately, Lee will inspire even 
experts with her efforts at elucidating a 
field often seen as arid and inscrutable. 
Close Encounters with Humankind empha- 
sizes how much the past matters. Our 
6-million-year story has been massively 
shaped by chance and a changing environ- 
ment. Lee shows that, now more than ever, 
our decisions can shape the future of Earth 
and its inhabitants, including ourselves. m 


Maria Martinon-Torres is a 
palaeoanthropologist, director of the 
National Research Centre on Human 
Evolution (CENIEH) in Burgos, Spain, and 
an honorary reader at University College 
London. 

e-mail: maria.martinon@cenieh.es 
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Lightbox images showing the development of a nematode worm (Caenorhabditis elegans), one of the complex cellular systems studied at the Crick Institute. 


The quest for order 


Philip Ball tours a cross-disciplinary exhibition exploring patterns in bioscience. 


landed next to St Pancras railway 
station, London’s gateway to Europe. 
From the start, the biomedical powerhouse 
that is the Francis Crick Institute set out 
to put its stamp on a district already home 
to University College London, the Wellcome 
Trust and Nature. But how to carve out 
a cultural niche in an area buzzing with 
scientific and artistic activity? 
Deconstructing Patterns is part of that 
endeavour. This collaboration between the 
Crick’s scientists, established artists and 
young people from a community arts pro- 
gramme has created works inspired by what's 
going on in the institute's labs. As curator 
Bryony Benge-Abbott told me, the theme 
of patterns was chosen because a quest for 
order and regularity underlies the Crick’s 
research into various extremely complex 
systems. These range from genome- 
sequence analysis to Drosophila neurology 
and the development of the nematode worm 
Caenorhabditis elegans. 
It’s not clear how much of that comes 
across in the exhibits. A discussion of, say, 
the role of transcription factors or forces 


E 2016, a steel-and-glass spaceship 
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in myosin-actin Deconstructing 


networks in guiding Patterns 
development presents Francis Crick Institute, 
London. 


a communication 
challenge, no matter 
how beautifully illus- 
trated with micrographs of stained cells. But 
the displays’ pedagogical ambitions are also 
undermined for the right reason: artists have 
other agendas. However laudable the efforts 
to suggest that artists and scientists share 
a goal of making sense of life’s complexity, 
this exhibition reveals that they ask different 
questions and find different answers. 

This is most evident in the collaboration 
between the C. elegans team in the Polarity 
and Patterning Networks Laboratory, led by 
Nate Goehring, and young filmmakers from 
the 1A Arts project. This partnership began 
last year with a one-day workshop in which 
the scientists shared their work with a group 
of teenagers aged 14 to 17, some without 
a science background. “Our aim was to get 
them thinking about how animals develop, 
and expose them to some key concepts,” says 
Goehring. “That is, symmetry and asymme- 
try, the importance of breaking symmetry to 


Until 1 December. 


2018 


generate different types of cell and specifying 
geometric axes, the need to assign differ- 
ent cell identities, and how simple spatial 
rules can give rise to complex form?” The 
researchers devised exercises using building 
blocks connected in different ways to illus- 
trate how morphology and symmetry can 
arise from local, algorithmic rules. 

What the 1A Arts team came up with 
was quite different: a short, metaphori- 
cal narrative film about the transformative 
experience of an office worker caught in a 
repetitive regime. Despite the absence of 
explicit scientific content, Goehring was 
pleased with the result. “It was great to see 
them take advantage of their artistic freedom 
to come up with something entirely their 
own, he says, adding that the young people 
nevertheless “clearly internalized the core 
concepts — the importance of disrupting 
uniformity, defining identity, rules” In that 
regard, he says that he sees aspects of the 
principles of developmental morphology 
reflected in the film. 

Deconstructing Patterns showcases two 
other collaborations. In the more evocative 
of these, Australian artist Helen Pynor — a 
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biology graduate whose work explores, 
often viscerally, themes relating to living 
matter such as human organs — filmed 
Crick researcher Iris Salecker describing the 
development of the Drosophila neural cir- 
cuitry for vision. Only Salecker’s hands and 
arms are visible, and there is no soundtrack to 
explain what we're seeing. This silent, manual 
choreography is mesmerizingly beautiful. It 
also captures the eloquence and sophistica- 
tion of the unconscious gesture — and the 
limitations of language alone for expressing 
scientific ideas. 

“The scientific story is about the 
movement of things in space, and hard to 
convey in words,’ says Pynor. Salecker’s use of 
body and gesture, Pynor adds, “fills in some 
of the gaps, even metaphorically, between 
verbal language and spatial meaning”. View- 
ers learn nothing about fruit-fly vision from 
this film, but they do learn a great deal about 
the roles of spatial conceptualization and 
the sense of dynamic process involved in 
thinking about this complicated system. 

As Pynor notes, science does not take 
place in some abstracted universe where 
the observers have a disinterested distance 
from their subject. Instead, as she puts it, 
“the love of scientists for their subject, the 
presence of their own bodies in dialogue 
with the bodies they study, and ... their aes- 
thetic sensitivity, are all layered into their 
research”. Salecker, meanwhile, says the 
film revealed that her team’s research is “in 
essence a 3D dynamic structural problem, 
and that we are limited by our imagery and 
language. This insight will influence how 
we present data in the future.” 

The exhibition also brought together 
award-winning poet Sarah Howe, sound artist 
Chu-Li Shewring and Greg Elgar, former 
head of the Crick’s Advanced Sequencing 
team. The team’s quest to “find meaning in 
noise” had an obvious appeal to Shewring. 
Her installation allows viewers to experience 
a sound composition within acoustic “hoods” 
that are suspended from the ceiling; included 
is a reading of Howe's poem ‘A New Music, 
inspired by conversations with Elgar. 

“Alphabet of us, cipher deciding the 
exact moment genes flip on and where’, 
Howe’s poem begins. The words, fugitive 
in Shewring’s soundscape, suggest that 
representations of the genome are moving 
beyond the static, deterministic metaphors 
ofa ‘book oflife. Shewring was keen to keep 
interpretation open and uncertain, aptly 
reflecting the dance between prescrip- 
tiveness and contingency in the dynamic 
unfolding of genetic information. As a first 
step towards a more sophisticated public 
image of the genome, I'll settle for that. m 


Philip Ball is a writer in London, and 
author of Patterns in Nature. 
e-mail: p.ball@btinternet.com 
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Books in brief 


How to Fix the Future 

Andrew Keen ATLANTIC (2018) 

Silicon Valley insider Andrew Keen has long been calling out the 
“disturbingly centralized, unequal, and creepy” aspects of the digital 
revolution. In this acerbic, articulate global survey of human-centred 
solutions, he examines best practice in consumer choice, education, 
innovation, regulation and social responsibility. His journey takes 

in digital investor John Borthwick’s call for antitrust regulation and 
“human-centric design”; lessons from China, Estonia and Singapore 
in how, or how not, to digitize; and distinct signs of cognitive clarity 
in no-tech schooling. An invigorating mix of principle and vision. 


A Longing for Wide and Unknown Things 

Maren Meinhardt Hurst (2018) 

The heroic narrative is a poor fit for Alexander von Humboldt (1769- 
1859), argues Maren Meinhardt in her subtle biography. An editor 

at The Times Literary Supplement, Meinhardt paints the polymath as 
a creature of “contradictions and ambiguous achievements” firmly 
rooted in German Romanticism. Ever striving for the top, whether 
climbing Ecuador’s Chimborazo mountain or writing Cosmos (1845), 
he often found his reach exceeding his grasp; even his personal life 
seems oddly indeterminate. An intriguing companion read to Andrea 
Wulf’s Invention of Nature (Knopf, 2015; see Nature 525, 31; 2015). 


The Rise of Yeast 

Nicholas P Money OXFORD UNIVERSITY PRESS (2018) 

Yeasts are firmly embedded in the substrate of human culture. The 
sugar fungus Saccharomyces cerevisiae is, of course, central to the 
making of wine, beer, bread and biofuels; and other members of 
this unicellular eukaryotic clan are used as ‘lab rats’ in molecular 
genetics, or help produce drugs such as the antimalarial artemisinin. 
They’re all over (and inside) us, too. Botanist Nicholas Money’s 
effervescent tour is packed with delights, such as illustrations by 
Charles Tulasne, the “Audubon of fungi”, or the revelation that yeasts 
and humans share a common ancestor (and hundreds of genes). 


Lyme: The First Epidemic of Climate Change 

Mary Beth Pfeiffer ISLAND (2018) 

“This is an illness that has been minimized, underestimated, and 
politicized.” Thus says investigative reporter Mary Beth Pfeiffer on 
Lyme, the tick-borne disease now on the march in North America, 
Europe and Asia. As Pfeiffer’s hard-hitting study reminds us, non- 
specific symptoms and other complexities make tackling Lyme a 
formidable challenge (see also J. G. Logan Nature 552, 174; 2017). She 
nimbly interweaves numerous strands of research — into the influence 
of climate change on the Lyme invasion, the disease, the pathogen, the 
vectors and the harrowing impacts borne by some sufferers. 


The Missing Ingredient 

Jenny Linford PARTICULAR (2018) 

Time is the key ingredient in the culinary lab, argues Jenny Linford, 
cleverly reframing every step of the ‘food chain’ as poised on the 
clock’s tick. From this perspective, we see classic veal stock as a 
24-hour marathon, perfectly roasted coffee as a 4-minute drama and 
exquisitely fresh, consummately cooked sole as a mad sprint from 
boat to plate. Linford hardly draws breath as she zips from the time it 
takes for grated wasabi to develop its piquant kick (5 minutes) to the 
moment sauerkraut reaches peak fermentation. Barbara Kiser 
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Don’t let triage put a 
gloss on extinctions 


Conservation triage, the 
prioritization of conservation 
efforts by explicit economic 
accounting, may not be used 
to determine listing decisions 
under the US Endangered 
Species Act of 1973. This could 
change with the proposed US 
H.R.717 Listing Reform Act that 
has been submitted to Congress 
(see also go.nature.com/2e6s8fo 
and go.nature.com/2bftgd4). 
The bill proposes to “preclude 
the listing of a species as 
threatened due to the likelihood 
of significant, cumulative 
economic effects that would 
result from such listing or from 
the likely resulting designation 
of critical habitat of the species”. 
Economic effects include those 
relating to public and private 
lands, property values, the 
provision of public services, 
employment and revenues 
available for governments. This 
triage process would effectively 
legitimize species extinctions 
by ruling out conservation 
programmes that conflict with 
economic interests. 
Conservation triage puts 
a mathematical gloss on 
extinctions, presenting them as 
neutral outputs of optimization 
algorithms and branding itself 
as effective science that is 
based on data and not dogma. 
It is crucial to oppose the ‘new 
conservation’ paradigm: it is 
performing a conceptual triage 
on conservation itself. 
Guillaume Chapron Swedish 
University of Agricultural 
Sciences, Uppsala, Sweden. 
Yaffa Epstein Uppsala University, 
Sweden. 
José Vicente Lépez-Bao Oviedo 
University, Spain. 
guillaume.chapron@slu.se 


Pakistan heading for 
groundwater crisis 
The United States, China, India, 


Iran and Pakistan together 
account for more than 60% of 


the groundwater that is removed 
worldwide each year. Pakistan 
alone withdraws 6.6%, amounting 
to 65 cubic kilometres annually — 
10km? more than is replaced each 
year by natural processes. 
Pakistan, as a leading exporter 
of water-intensive crops such 
as rice, uses more global 
groundwater (29%, based on 
2010 estimates) for agricultural 
exports than any other country 
(C. Dalin et al. Nature 543, 700- 
704; 2017). It recycles just 1.2% 
of its urban wastewater, whereas 
China and India reuse 71% 
and 22% of theirs, respectively 
(A. L. Thebo et al. Environ. 
Res, Lett. 12, 074008; 2017). 
Wastewater that is dumped 
into natural streams or used in 
irrigation ultimately finds its way 
into groundwater. 
To improve management 
of Pakistan's groundwater and 
avert a crisis, research is urgently 
needed into the impact of these 
practices on the quality and 
quantity of groundwater. 
Muhammad Arif Watto, 
Safdar Bashir, Nabeel Khan 
Niazi University of Agriculture 
Faisalabad, Pakistan. 
arifwattoo@ymail.com 


Add societal impact 
to the syllabus 


We agree that societal impact 
should be rated more highly 
in scientific publishing and 
research evaluation (Nature 553, 
5; 2018). To this end, we suggest 
that ways to achieve it should 
be introduced as an important 
component of curricula at 
higher-education institutions. 
Degree theses and university 
classes in academic publishing 
are generally structured with 
bibliometric output in mind 
because that is the main driver 
of tenure and promotion. 
They rarely touch on the 
merits of societal impact. This 
monotheistic evaluation of 
academic pursuits means that 
few faculty members make 
time for public outreach. 
Even new journals for applied 
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excellence address a symptom 
of insufficient societal impact in 
research, not the cause. 

Instead of relying solely on 
papers and citations as proxies 
for impact, funders and research 
organizations need to broaden 
their assessments of scientific 
output. They should also 
acknowledge other important 
outputs, such as developments in 
scientific products and services, 
important data sets, platforms 
and software, as well as their 
influence on policy. 

Sascha Friesike Vrije Universiteit 
Amsterdam, the Netherlands. 
Benedikt Fecher Alexander von 
Humboldt Institute for Internet 
and Society, Berlin, Germany. 
Gert G. Wagner Max 

Planck Institute for Human 
Development, Berlin, Germany. 
fecher@hiig.de 


Maximize impact of 
Earth observations 


To address challenges such as 
climate change, disaster resilience 
and the security of water and 
food, it is crucial for countries, 
organizations and individuals to 
work together to coordinate Earth 
observations. They need to ensure 
that these are sustained, and share 
data to inform decision-making 
(see, for example, G. Huadong 
Nature 554, 25-27; 2018). As 
secretariat director of the Group 
on Earth Observations (GEO), 

I can vouch for the group’s 
progress in realizing the global 
Earth observatory envisaged by 
Markku Kulmala (see Nature 553, 
21-23; 2018). 

Over the past decade, the 
GEO community of 105 national 
governments and 118 partners 
has been building the Global 
Earth Observation System of 
Systems (GEOSS). Collaboration 
on more than 70 targeted 
activities is under way in the 
GEO programme (see go.nature. 
com/2nvvgo6). This coordinates 
and integrates observations 
from space-based and in situ 
platforms, fills data gaps, and 
increases capacity across a range 


of geographical, thematic and 
cross-cutting areas (including 
several that Kulmala mentions). 
GEOSS has already brokered 
more than 400 million open- 
data and information resources 
from some 5,000 providers 
(www.geoportal.org). 

We welcome extra partners 
from countries, international 
organizations, research institutes, 
academia and the private sector 
to join us in this endeavour and 
in helping to fulfil the actions and 
resources for which Kulmala calls. 
Barbara J. Ryan Group on Earth 
Observations (GEO), Geneva, 
Switzerland. 
bryan@geosec.org 


Japanese photo of 
Rosalind Franklin 


I happened upon a snapshot 

of Rosalind Franklin — whose 
landmark papers in Nature 
contributed to the unravelling 
of DNAs structure in 1953 — in 
a Japanese journal of carbon 
science (see S. Sonoda Tanso 3, 
133-136; 1953). It seems fitting 
to have rediscovered it this year, 
which marks 60 years since 
Franklin’s death at the age of 37. 

Franklin’s earlier work on the 
structure of graphite was known 
worldwide in the carbon field 
(for a review, see P. J. E Harris 
Interdiscip. Sci. Rev. 26, 204-210 
2001). Japanese carbon researcher 
Susumu Sonoda visited her 
in London in 1953, after she 
had completed her two-year 
study of DNA at King’s College 
London and had moved on to 
investigate tobacco mosaic virus 
at Birkbeck College, London. He 
was travelling around Europe 
and the United States to interact 
and exchange ideas with fellow 
carbon researchers. 

Sonoda discussed the problems 
of carbon graphitization with 
Franklin. With her permission, he 
then took a photo (see go.nature. 
com/2eeg2tx) to illustrate his 
report for Tanso. 

Yoshihiko Watanabe Kyoto, 
Japan. 
watanabe_yosh@yahoo.co.jp 
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SPACE PHYSICS 


The origin of pulsating auroras 


Spectacular light shows in Earth’s atmosphere called pulsating auroras are directly linked to processes in space. After 
decades of research, the full chain of events that creates such auroras has been observed. SEE LETTER P.337 


ALLISON N. JAYNES 


he Northern and Southern lights, also 

known as auroras, are as varied as the 

colours they display in the night sky. 
Discrete auroras are the kind that typically 
grace our desktops and calendar covers, and 
that are produced a few thousand kilometres 
above Earth’s surface. By contrast, pulsating 
auroras that are created tens of thousands of 
kilometres away, in the equatorial region of the 
magnetosphere — the area around Earth that 
is dominated by the planet’s magnetic field. For 
decades, it has been suggested that pulsating 
auroras are the result of interactions between 
magnetospheric electrons and electromagnetic 
waves called chorus waves that send electrons 
careering towards Earth’s atmosphere along 
magnetic-field lines’. On page 337, Kasahara 
et al.’ report direct evidence for this process 
using observations both from Earth’s surface 
and from a spacecraft positioned ona field line. 

Because magnetic fields are invisible to the 
human eye, the prediction of where a field line 
hits Earth and where that same field line exists 
out in space — a task known as magnetic- 
field-line mapping — is extremely difficult’. 
Luckily, electrons that move around Earth tend 
to follow these field lines closely. When these 
particles interact with chorus waves, they can 
be directed into a region of the upper atmos- 
phere called the ionosphere, where they often 
generate auroral light. This allows us instantly 
to see the footprint of the associated field lines. 

In addition, if we have an observation 
platform at a precise location out in space, we 
can detect the chorus waves that caused the 
electrons to head towards the atmosphere and 
see fluctuations in the electron population that 
arise from the oscillation of the waves. The 
trick is to get the ground-based and space- 
based observations to line up at the right time 
and place, and to have instruments sensitive 
enough to view both processes simultaneously. 
This feat has eluded observers ever since the 
theory of pulsating-aurora generation was 
developed®”. 

The first challenge is to have an instrument 
capable of making the in situ measurement of 
electrons in space at the required resolution. 
The Arase spacecraft®, launched by the Japan 
Aerospace Exploration Agency in late 2016, 


Pulsating 
aurora 


Magnetospheric equator 


Electron 


Magnetic-field 
line 


Figure 1 | Pulsating-aurora generation. Kasahara et al.’ report evidence for a mechanism that explains 
the occurrence of blinking patches of light in Earth’s atmosphere called pulsating auroras. In a region 
surrounding Earth known as the magnetosphere, electrons are trapped by the planet's magnetic field 

and travel (red arrows) along magnetic-field lines. When these particles interact with electromagnetic 
waves called chorus waves, which are generated in the equatorial region of the magnetosphere, they 

can be directed towards Earth's atmosphere, where they produce auroral light. The authors detected the 
interactions between the chorus waves and the electrons using the Arase spacecraft’, which was positioned 


on the relevant field line. 


carries just such an electron detector, which 
was designed to observe electrons within a 
narrow window around a magnetic-field line. 
The spacecraft is also equipped with instru- 
ments to detect chorus waves. Kasahara and 
colleagues analysed data from the spacecraft 
to uncover fluctuations in the electron popula- 
tion and the associated chorus waves. 

The next obstacle was to determine where 
the field line threading the position of the 
spacecraft would hit the ground. Magnetic-field 
models are now sophisticated enough to be able 
to inform researchers of the approximate loca- 
tion of a field-line footprint in Earth’s atmos- 
phere, generally with higher accuracy when the 
level of geomagnetic activity (magnetic storms) 
is low. In the vicinity of this footprint, Kasahara 
et al. looked for a corresponding pulsating- 
aurora signal — namely, variations in auroral- 
light intensity that matched the fluctuations in 
the electron population. They identified such a 
signal in measurements from an all-sky imager 
based in central Canada’, which essentially 
records black-and-white video of the hemi- 
spherical view of the sky above (see Figure 2 
of the paper’). 

Thanks to Kasahara and colleagues, we can 
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see the complete process of pulsating-aurora 
generation for the first time: the fluctuations in 
an electron population out in space; the chorus 
waves responsible for these fluctuations; and 
the variations in auroral-light intensity from 
the ground (Fig. 1). The last part is somewhat 
analogous to watching an image on an old- 
fashioned television, where the ionosphere is 
the ‘screen’ onto which electrons are projected. 
Despite this simple picture, researchers are 
aware that the ionosphere probably changes 
the incoming signal — a detail that will no 
doubt be scrutinized in future studies. 

Kasahara et al. carried out an analysis in 
which they correlated the electron fluctuations 
and chorus waves in space with the pulsating- 
aurora signals seen by the all-sky imager on the 
ground. This step revealed the precise location 
in the atmosphere in which the field-line foot- 
print resides. Such a technique has incredible 
potential to test and refine our current mag- 
netic-field models by comparing the modelled 
footprint location to the observed location. In 
the future, magnetic-field-line mapping might 
well rely on a similar methodology to gain 
insight into magnetic topology — the structure 
and linkage of field lines. 


BEN ASHTON 


There is one caveat, however: clear skies are 
required to see and measure the pulsating- 
aurora signals, so Earth's terrestrial weather 
needs to cooperate. Furthermore, the chorus 
waves contain components of different fre- 
quency that interact with magnetospheric 
electrons in different ways depending on the 
energy of the particles. This affects which 
particles end up travelling down to Earth’s 
atmosphere. These details are directly related 
to geomagnetic activity and have not yet been 


ANIMAL BEHAVIOUR 


fully quantified. There is still a rich body 
of research to be carried out regarding the 
mysterious pulsating auroras. m 


Allison N. Jaynes is in the Department of 
Physics and Astronomy, University of Iowa, 
Iowa City, Iowa 52242, USA. 

e-mail: allison-n-jaynes@uiowa.edu 
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Brainpower boost for 
birds in large groups 


Whether intelligence is selected for in species that have a complex social life is 
debated and hard to test. Cognitive performance and associated reproductive 
success are now linked to group size in wild magpies. SEE LETTER P.364 


ANDREW WHITEN 


bservations of primates’ everyday 

lives led the psychologist Nicholas 

Humphrey to make a revolutionary 
proposal’ in 1976 to explain primate intelli- 
gence. Before then, it had been commonly 
assumed that these animals’ cleverness was an 
adaptation to their physical niches, reflected 
in their need for sophisticated skills in realms 
such as foraging, navigation or avoiding pred- 
ators. Humphrey suggested instead that the 
complex social dynamics experienced when 
such animals live in a group become the main 
selective force driving the evolution of primate 
intelligence. On page 364, Ashton et al. offer 
support for Humphrey’s social-intelligence 
hypothesis, in a study of wild Australian mag- 
pies (Cracticus tibicen dorsalis, also known as 
Gymnorhina tibicen dorsalis). 

The causal relationships between social 
complexity, intelligence and reproductive 
success proposed by Humphrey inspired a 
generation of primatologists, who uncov- 
ered unexpected sophistication in monkeys’ 
and apes’ social knowledge and political 
manoeuvrings’’. However, these discoveries 
arguably made it difficult to test Humphrey's 
hypothesis directly, because intelligence — 
both in social interactions and in non-social 
realms, such as foraging or tool use — and 
social complexity were revealed to be com- 
posed of many components*”. Primate social 
complexity, like intelligence itself, was found 
to be extraordinarily complex. 

In 1995, primatologist Robin Dunbar 
suggested’ that focusing on the typical group 
size of a species as a proxy for social complex- 
ity, and on its brain size instead of intelligence, 


might resolve the dilemma of how to test 
Humphrey’s ideas. Both measurements were 
available for a range of primates — and, as 
predicted, a positive relationship was found 
between these factors. Multiple teams rep- 
licated the finding, using a range of related 
variables — for example, measuring the rela- 
tive sizes of the neocortex region rather than 
overall brain size — for primates’ and other 
taxa®. Yet, as Ashton and colleagues acknowl- 
edge, analyses of large databases often provide 
conflicting results. When many variables differ 
between species, cross-species comparisons can 
lack robustness, because compensating for the 
differences can make a study so unwieldy that 
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ons 


it undermines reliable testing of a hypothesis’. 

Ashton and colleagues turned instead 
to intraspecies comparisons. They studied 
56 magpies, which were ringed to enable 
identification, from 14 different territorial 
groups that ranged in size from 3 to 12 birds 
(Fig. 1). Rather than measuring brain size, the 
authors conducted cognitive-performance 
tests in which the birds encountered wooden 
or plastic devices that tested problem-solving 
skills; successful birds received a mozzarella- 
cheese treat. Four different devices each tested 
a specific skill, including spatial memory and 
the ability to learn new associations between 
stimuli and rewards. 

The authors report that group size is linked 
to cognitive performance. Birds living in a 
larger group displayed better performance at 
the population level on each of the tests than 
did birds living in smaller groups. At the indi- 
vidual level, exceptions to this trend could 
be found — some birds from smaller groups 
outperformed birds from larger groups, and 
vice versa. 

The authors recorded the reproductive 
success of individuals by counting the average 
number of hatched clutches of eggs per year, 
and found that birds that performed well on 
the tests had greater reproductive success than 


Figure 1 | Australian magpies in Guildford, Western Australia. Wild Australian magpies 

(Cracticus tibicen dorsalis, also known as Gymnorhina tibicen dorsalis) are territorial and live in groups. 
Ashton et al.’ analysed the relationship between the birds’ group size and cognitive performance to test 
the long-debated idea’ that life in complex social groups can select for intelligence. At the population 
level, larger groups of birds performed better than smaller groups in cognitive-performance tests, and 
cognitive performance was also linked to reproductive success. 
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birds that performed poorly. Evidence for a 
connection between test performance and 
biological fitness could not have been assessed 
by the earlier approaches based on interspecies 
comparisons. 

For any given individual, the bird’s 
performance on each of the four skill tests 
was correlated. A principal-components sta- 
tistical analysis, which identifies the number 
of factors accounting for variance in an array 
of scores, showed that a single unknown 
factor accounted for 65% of the variance in 
test scores. The authors refer to this factor as 
‘general intelligence; analogous to the ‘g factor’ 
used to assess general intelligence in humans. 
However, it is worth noting that the magpie 
tests assess learning abilities rather than test- 
ing the capacity to invent creative solutions to 
problems — a talent sometimes considered 
to be a defining characteristic of animal or 
human intelligence. 

Humphrey proposed that intelligence 
evolved in response to the pressures of social 
complexity. Ashton and colleagues, however, 
did not directly address this evolutionary 
hypothesis; instead, they investigated the rela- 
tionship between social-group size and the 
development of cognition in the birds’ early 
life, a linkage that was observed to emerge by 
the time the birds were 200 days old. Neverthe- 
less, the link between cognitive prowess and 
fitness identified by Ashton and colleagues 
has major implications for connecting the 
social-intellect hypothesis to an underlying 
evolutionary mechanism, and it suggests that 
selection is acting on a relationship between 
sociality and cognition. 

Group size is, of course, a crude index of 
social complexity, just as brain size is a crude 
indicator for the complexities of cognition 
being selected for. Indeed, group size itself can- 
not be the key causal factor — the immensity 
of a wildebeest herd, for example, is unlikely 
to select strongly for intellect. And even three 
individuals can suffice to create a high level 
of social complexity, as demonstrated in the 
humorous account provided by Jerome K. 
Jerome's 1889 book Three Men in a Boat, or 
documented in an analysis’ of three adult 
chimpanzees that repeatedly shifted alliances 
in a way that allowed each to be supreme for 
a while in their ‘game of thrones: Ashton and 
colleagues can only speculate on how mag- 
pies’ cognitive powers relate to social-group 
size and number of offspring, and suggest that 
some kind of political skill, such as the ability 
to successfully handle or avoid conflicts, might 
be involved. 

Which types of intelligence deserve closer 
attention in future studies of the social- 
intelligence hypothesis? Some substantially 
different ideas exist regarding which form 
of intelligence might be selected for*. One 
such contrast is between whether social 
complexity selects for overall intelligence 
across many different contexts (also known 


as domain-general intelligence), or for 
specialized forms of intelligence, such as the 
social skill of understanding what others might 
be thinking. This might sound like a tall order 
for a magpie. But there is evidence’’ for such 
sophisticated behaviour in the crow family — 
a western scrub-jay (Aphelocoma californica) 
might make a theft-prevention manoeuvre by 
relocating hidden food elsewhere if it spots 
that another bird observed where the food 
was hidden. Social cognition has numerous 
other manifestations’. Yet, what Ashton 
et al.’ tested was essentially non-social cog- 
nition. What now begs to be fleshed out is 
the nature of both the social and non-social 
intelligence skills that may have been at work 
in the phenomena these authors observed. m 
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Burst firing sets the 
Stage for depression 


Salvos of neuronal activity in the brain’s lateral habenula, regulated by astrocyte 
cells, drive depression-like behaviours in rodents. The finding might help us to 
understand one antidepressant and to develop more. SEE ARTICLES P.317 & P.323 


WILLIAM M. HOWE & PAUL J. KENNY 


pposing forces shape our everyday 

lives — for instance, stimuli can 

encourage us to move or stop, and 
events can make us happy or sad. Accordingly, 
our brains are designed with ‘yin-yang’ systems 
that guide our actions and influence our feel- 
ings. Neurons in the brain’s mesolimbic system 
promote reward-seeking behaviour and help 
to process information about actions that 
result in pleasurable outcomes! ?, By contrast, 
neurons in the lateral habenula (LHb) encode 
information related to noxious outcomes and 
suppress reward-seeking* °. Unbalancing these 
opposing systems might therefore affect our 
behaviour. Indeed, emerging evidence’ sug- 
gests that LHb hyperactivity contributes to 
mood disorders such as major depression. 
Two papers*” in Nature now shed light on the 
mechanisms that underlie LHb hyperactivity, 
and on how the antidepressant drug ketamine 
modulates this state. 

In the first paper, Yang and colleagues® 
(page 317) assessed the firing activity of LHb 
neurons in two rat models of depression. 
Neuronal firing involves depolarization of the 
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electrical potential across the cell membrane 
(in a resting state, the inside of the cell is nega- 
tively charged relative to the extracellular 
space around it). Hyperpolarization, in which 
the cell interior becomes more negative than 
normal, is typically associated with neuronal 
inhibition. 

By studying brain slices ex vivo, Yang and co- 
workers showed that LHb neurons were more 
likely to fire in a pattern of rapid bursts in the 
‘depressed’ rats than in control animals. They 
also observed that, when the LHb neurons 
were hyperpolarized, this increased the likeli- 
hood that these cells would fire in bursts rather 
than steady volleys. The researchers went on to 
show that they could increase depression-like 
behaviours in rats using a genetic manipula- 
tion to drive hyperpolarization, and so burst 
firing, in LHb neurons. 

Next, the group investigated the signals 
that regulate this burst firing. In other 
brain regions”, burst firing is controlled by 
N-methyl-p-aspartate receptors (NMDARs) 
— membrane-spanning channel proteins 
whose activation leads to an influx of positively 
charged calcium ions into neurons, resulting 
in depolarization and neuronal firing. Yang 


and colleagues found that burst firing of LHb 
neurons required the activity of NMDARs and 
of another class of protein, T-type voltage- 
sensitive calcium channels (T-VSCCs). 

Could inhibition of NMDARs prevent 
bursting? Ketamine is an NMDAR inhibitor 
and a promising, rapid-acting antidepressant 
in humans (taking effect in as little as 30 min- 
utes)" that is currently in clinical trials for the 
treatment of major depressive disorder with 
imminent risk of suicide. The mechanisms by 
which ketamine acts have been a puzzle to sci- 
entists. Strikingly, Yang et al. found that local 
infusion of ketamine into the LHb elicited anti- 
depressant-like responses in depression-prone 
rats. These findings suggest that the therapeutic 
actions of ketamine might relate, at least in part, 
to its ability to block burst firing in the LHb. 

In the second paper, Cui et al.” (page 323) 
turned their attention to the mechanisms by 
which LHb neurons become skewed to fir- 
ing in burst mode during depression. The 
authors performed a large-scale analysis of 
differentially expressed proteins in the LHb. 
This revealed that the expression of Kir4.1, a 
component of potassium-ion (K") channels, 
was increased in the LHb of depression-prone 
rats compared with that of controls. 

Kir4.1 is expressed in astrocytes, cells that 
interact with neurons to influence their activ- 
ity state’ (although the functional relevance 
of such interactions is still being defined). 
The researchers showed that overexpression 
of Kir4.1 in LHb astrocytes increased the 
burst firing of local neurons and precipi- 
tated depression-like behaviours in mice. 
Conversely, reducing Kir4.1 expression in 
depression-prone rats reduced the burst firing 
of LHb neurons, and attenuated the animals’ 
depression-like behaviours. 

How do Kir4.1-containing channels in 
astrocytes regulate the activity of neurons? 
Neurons can pump K* from their cytoplasm 
into the extracellular space to cause hyper- 
polarization. Cui et al. provide evidence that 
astrocytic K* channels in the LHb help to clear 
away extracellular K*. This facilitates the abil- 
ity of LHb neurons to enter a hyperpolarized 
state, and hence to fire in bursts (Fig. 1). Future 
studies will be required to understand whether 
LHb astrocytes interact with neurons in other 
ways to influence their patterns of activity. 

Together, these two papers provide crucial 
insights into a depression-associated pattern 
of cell firing in the LHb and its regulation by 
ketamine. However, exactly why burst firing of 
LHb neurons increases depression-like behav- 
iours in rodents remains unclear. One potential 
explanation, put forward by Yang et al., is the 
inhibitory influence that LHb neurons exert 
over reward-associated dopamine neurons in 
the mesolimbic system and mood-associated 
serotonin neurons in the midbrain. Perhaps 
burst firing of LHb neurons alters the activity 
of these downstream neurotransmitter systems 
in a manner that reduces their positive effects 
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Figure 1 | Burst firing in depression. a, The activation of membrane-spanning N-methyl-p-aspartate 
receptors (NMDARs) leads to an influx of calcium ions (Ca”*) that depolarizes neurons (the insides 

of which are negative relative to the outside in resting conditions), and causes firing. Ca”* influx also 
leads to an efflux of potassium ions (K*) through K*-channel proteins. Yang et al.* show that, in the 
brain’s lateral habenula (LHb), coordinated activity of NMDARs and T-type voltage-sensitive calcium 
channels (T-VSCCs) causes neuron firing to occur in a pattern of rapid bursts, and that such firing 

leads to depression-like symptoms in rats through unknown mechanisms. b, Cui et al.’ demonstrate 
that K* ions are rapidly cleared away by nearby cells called astrocytes, which express K* channels 
containing the protein Kir4.1. This facilitates rapid K* efflux from the neuron and its entry into a state of 
hyperpolarization, which increases the likelihood of burst firing’. 


on mood, thereby increasing vulnerability to 
depression. Testing this possibility, and other 
potential mechanisms, will require further 
experimentation. The potential involvement 
of subpopulations of dopamine and serotonin 
neurons that encode aversion-relevant stimuli 
must also be considered”. 

The current studies have several therapeu- 
tic implications. First, Cui and colleagues’ 
data show that astrocytes might have a key 
role in regulating a brain system involved in 
mood and motivation. Previous studies link- 
ing astrocyte function to disease states have 
focused mostly on neurodegenerative and 
developmental disorders'*"’, but the current 
papers suggest that modulating the activity of 
these cells might be a way to treat psychiatric 
disorders. 

Second, Yang et al. showed that a T- VSCC 
blocker delivered directly into the LHb had 
antidepressant-like effects, similar to those of 
ketamine. This raises the exciting possibility 
that T-VSCC blockers, or other compounds 
that suppress LHb burst firing, could be effec- 
tive antidepressants. 

Finally, the papers shed light on the possible 
mechanisms by which ketamine elicits rapid 
antidepressant effects in humans. Ketamine 
and its metabolites stimulate the formation 
of synaptic connections between neurons in 
the brain’*""’, a process that is thought to be 
important for the drug’s therapeutic effects. 
The findings suggest that another property 
of the drug — its ability to inhibit burst firing 
ina brain region implicated in aversion and 
negative mood — also contributes to its effi- 
cacy and explains its rapid onset of action. This 


knowledge might facilitate the development of 
next-generation ketamine-related antidepres- 
sants that specifically target LHb activity and 
that might eliminate two major side effects 
of ketamine and other NMDAR blockers: 
their abuse potential and the induction of a 
transient, schizophrenia-like, psychotic state. m 
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Working memory 
freed from the past 


Working memory is influenced by past experiences. An area of the rat brain 
has now been identified that represents recent history — silencing this area can 
remove biases from working memory and decision-making. SEE LETTER P.368 


LAURA BUSSE 


ur natural environment is character- 

ized by statistical regularities in both 

space and time — for example, it does 
not consist of randomly distributed light, but 
frequently contains objects that have elongated 
vertical and horizontal contours’ and predict- 
able motion’. These regularities are exploited 
by our brains to interpret the often-ambigu- 
ous information that reaches our senses, to 
improve perception, decision-making and 
working memory. Although, in the natural 
world, past experiences can thus have a vital 
role in information processing and cognition, 
past experiences can impair performance in 
laboratory experiments that involve random 
structure. On page 368, Akrami et al.’ take 
advantage of this difference to identify an area 
of the rat brain that holds recent sensory his- 
tory, and show that temporarily silencing this 
area removes history-related biases in mem- 
ory-guided decision-making. 

More than a century ago, the cognitive 
psychologist Harry Hollingworth identified a 
phenomenon called contraction bias‘, in which 
the representation of a stimulus in working 
memory is systematically biased towards the 
average of recent past observations. As an 
example of contraction bias, consider Holling- 
worth’s historic experiment, in which partici- 
pants were asked to judge the sizes of different 
cards. Subjects tended to overestimate the size 
of memorized cards if they were smaller than 
others used in the experiment, and to under- 
estimate the size of memorized cards that were 
relatively large. 

Akrami et al. first set out to show that, like 
these findings in humans, contraction bias 
governs the behaviour of rats. They trained 
rats in an auditory working-memory task, in 
which the animals had to determine which 
of two tones, delivered several seconds apart, 
was louder. This task requires rats to hold 
the loudness of the first tone in working 
memory during the delay period between 
the two tones, to enable comparison with the 
second (Fig. 1). 

The authors used a custom-built, high- 
throughput facility for automated rat training 
and testing, allowing them to collect data from 
almost halfa million trials. Analysis of the rats’ 


behaviour revealed that, in trials in which the 
first tone was the fainter, performance was 
improved if the previous trial had contained 
a pair of relatively faint tones, and hampered if 
it had contained louder tones. The group fitted 
powerful computational models to the rats’ 
behaviour that captured the animals’ recent 
sensory history. The models demonstrated 
that these systematic patterns of performance 
are predicted by contraction bias, which pulls 
the representation of the first tone (held 
in working memory) towards the recently 
experienced tones. In doing so, contraction 
bias facilitates or impedes the comparison of 
the two tones. 

In a search for the site of working memory 


in the rat brain, Akrami et al. next made a truly 
exceptional observation. They temporarily 
silenced a brain region called the posterior 
parietal cortex (PPC), which has been impli- 
cated in working memory’, and found that 
overall task performance was improved by 
this intervention, particularly when the PPC 
was silenced during the delay period. This 
observed cognitive enhancement is clearly 
inconsistent with the idea that the PPC main- 
tains the working-memory trace during the 
delay period, because silencing the PPC would 
erase the trace and hence lead to performance 
breakdown. 

What underlies this remarkable improve- 
ment in performance during PPC silencing? 
Akrami and colleagues fitted their computa- 
tional model to the rats’ behaviour during PPC 
silencing, and found that the improvement was 
due to a markedly reduced influence of past 
recent experience. Previous sensory informa- 
tion can distort working-memory representa- 
tions. Therefore, in the randomized world of 
laboratory experiments, it makes sense that 
silencing a brain area that holds traces of sen- 
sory history can lead to more bias-free work- 
ing-memory content and thus to behaviours 
more aligned with reality. 

To directly test how PPC neurons encode 
the recent past, Akrami et al. recorded PPC 
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Figure 1 | Biasing the working memory of rats. a, Akrami et al.’ trained rats in a working-memory 
task, in which the animals listened to two tones separated by a delay of a few seconds, and had to indicate, 
through a nose poke, which was the louder. The task requires the first tone to be held in working memory 
during the delay period. b, The authors investigated how recent sensory history affected trials (here, 
trials in which the first tone was the fainter). If the animal had recently been exposed to faint tones, it 
remembered the first sound as fainter than it really was, improving its ability to compare the two tones. 

c, By contrast, the animal’s judgement was hampered if it had recently been exposed to loud tones, 
because it remembered the first tone as louder than it really was. Thus, the rat's recent sensory history 
impinged on how it held the first tone in working memory. 
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activity during task performance. Consistent 
with their other results, the group found that, 
during the delay period, PPC neurons carried 
more information about the recent past than 
about the current tone held in working mem- 
ory. Furthermore, across rats, the percent- 
age of neurons holding such history-related 
information was tightly correlated with the 
level of history-induced bias in each animal's 
performance. 

It has long been recognized that even simple 
perceptual decisions are not isolated, static 
computations — rather, they are dynamic 
processes embedded in a stream of past infor- 
mation®. Akrami and colleagues have taken 
a crucial step towards understanding these 
processes, by identifying a node in the brain 
network that influences working-memory 
performance by holding a representation 
of recent history. That their discovery was 
made in an animal model offers the exciting 
possibility that future studies could investigate, 
at the cellular level, precisely which neurons 
in the PPC represent recent history, how this 
representation arises, and where and how itis 
integrated with sensory information to guide 
memory-based decision-making. 

Statistical regularities not only influence 
working memory, but can also bias motor 
outputs and the encoding and decoding of 
sensory information’. It is therefore likely that 
history-related biases are implemented across 
many nodes of the decision-making brain net- 
work in addition to the PPC, through various 
mechanisms. Long-term mechanisms, on evo- 
lutionary and developmental scales, include 
the adaptation of neurons in sensory areas to 
the statistics of the natural environment. For 
example, the over-representation of vertical 
and horizontal contours in our visual environ- 
ments is paralleled by an over-representation 
in the brain of neurons that respond preferen- 
tially to horizontal and vertical orientations®, 
and a biased ability to perceive these orien- 
tations’. Dynamic, short-term mechanisms 
occurring on a scale of milliseconds to sec- 
onds probably involve activity in long-range 
feedback or neuromodulatory circuits that 
adjust sensory areas””, the decision-making 
network"! and motor areas". Elucidating 
the interplay between these mechanisms is 
just one of the remaining challenges in the 
quest to understand the powerful role of past 
experience in working memory. m 
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The two faces of a 
magnetic honeycomb 


Quantum spin liquids are long-sought exotic states of matter that could transform 
quantum computing. Signatures of such a state have now been observed ina 
compound comprising iridium ions on a honeycomb lattice. SEE LETTER P.341 


MARTIN MOURIGAL 


cientists are searching for elusive forms 

of magnetism in which spins — atomic- 

scale ‘compass needles’ associated with 
electrons — perpetually dance to an intrinsic 
quantum beat. On page 341, Kitagawa et al.’ 
describe the synthesis and properties of a 
remarkable quantum magnet in which the 
ballet of spins persists down to a temperature 
of 0.05 kelvin. Such behaviour might be associ- 
ated with exotic magnetic excitations that are 
of great fundamental interest and are sought 
for quantum-computing technologies. 

Quantum magnets are often found in elec- 
trically insulating crystals. In such materials, 
unpaired electrons are arranged on a periodic 
lattice, which allows the spins of the electrons 
to interact with those of their neighbours. At 
low temperatures, these spins usually organize 
into symmetrical and regular patterns. 

In rare cases, however, quantum fluctuations 
prevent the spins from becoming ordered. 
Instead, the spins enter a quantum super- 
position — a concerted and perpetual dance 
in which the spins are entangled, meaning 
that they are inseparable and share a common 
quantum state. Many flavours of such states, 
called quantum spin liquids, have been pre- 
dicted’. At first sight, these states resemble 
paramagnetic materials, in which spins are 
disordered in the absence of an external mag- 
netic field. However, whereas spins behave as 
independent entities in paramagnets, those in 
quantum spin liquids are entangled with one 
another, even if separated by long distances. 

Experimental physicists have long pondered 
how to obtain and detect quantum spin liquids 
in real materials’. Much has been learnt from 
the study of one-dimensional quantum mag- 
nets — in particular, from chains of spins that 
exhibit antiferromagnetism’*, whereby each 
spin is aligned in the opposite direction to that 


of its neighbours. For instance, a by-product 
of long-range entanglement is the presence of 
magnetic excitations that have fractional quan- 
tum numbers (fractions of quantities such as 
electric charge and spin). These excitations have 
been shown to leave distinct fingerprints in 
measurements of a material's thermodynamic 
and magnetic properties. Furthermore, the 
presence or absence of an excitation gap (a lack 
of excitations that have particular energies) 
often reveals whether the underlying entangle- 
ment is short- or long-range, respectively. 

However, in spite of these breakthroughs, 
finding two- and three-dimensional quantum 
spin liquids has been a daunting task’. One 
approach has been to use geometric frustra- 
tion’, in which there is an incompatibility 
between the spatial arrangement of spins and 
their interactions. This causes many spin con- 
figurations to have the same energy, which 
jump-starts entanglement. Kitagawa and 
colleagues used a different materials-science 
strategy, and focused on a quantum spin liquid 
that was proposed by the theoretical physicist 
Alexei Kitaev’ in 2006. 

In Kitaev’s model, spins on a honeycomb 
lattice are forced to interact in seemingly 
unnatural ways. The resulting quantum spin 
liquid has two types of exotic magnetic excita- 
tion: Majorana fermions, which have fractional 
quantum numbers, lack an excitation gap and 
can propagate on the lattice; and other exci- 
tations that have a small excitation gap and 
remain localized. 

In a seminal paper’, it was demonstrated 
that the ingredients of Kitaev’s model might 
exist in real materials, accompanied by 
more-conventional spin interactions called 
Heisenberg interactions. It was later suggested® 
that the model could be realized in materi- 
als that have two key properties. The first is a 
strong coupling between the motion of elec- 
trons and their spin — a feature present in the 
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Figure 1 | The structure and spin interactions of H,Lilr,O,. a, Kitagawa et al.' report that, unlike other 
magnetic materials, the compound H,Lilr,O, does not show magnetic ordering at temperatures close to 
absolute zero (H, hydrogen; Li, lithium; Ly iridium; O, oxygen). A single unit of a layer of the compound is 
shown here. Octahedra (purple) formed from iridium and oxygen ions cause the iridium ions to be arranged 
ona honeycomb lattice. b, The magnetic properties of H,Lilr,O, suggest that at least two types of interaction 
are at play between the spins (magnetic moments) of neighbouring iridium ions. The first are Heisenberg 
interactions (black lines), which cause spins to align in the opposite direction to that of their neighbours. 
The second are Kitaev interactions (coloured lines), which predispose neighbouring spins to co-align in 

one of three possible orientations, depending on which bond of the lattice is considered. Each line colour 
corresponds to a different preferred orientation. Because each lattice site participates in three distinct bonds, 
a given spin receives contradictory requests from its neighbours, which prevents magnetic ordering. 


outer electron shells of some transition-metal 
ions. The second is a specific bonding geometry, 
which is achieved when octahedra formed 
from metal and oxygen ions share their edges 
to form a honeycomb lattice. Compounds such 
as lithium iridate” (Li,IrO,), sodium iridate 
(Na,IrO;) and ruthenium chloride’? (RuCl,) 
fulfil these requirements. However, all of these 
compounds become magnetically ordered 
below a few kelvin, which excludes them as 
genuine realizations of Kitaev’s model. 

Using an approach known as soft chem- 
istry, in which materials are modified under 
mild temperature conditions (here, at 120°C), 
Kitagawa et al. replaced inter-layer lithium 
ions of Li,IrO, with hydrogen ions, while 
preserving the overall structure of the mag- 
netic-honeycomb layers. The authors found 
that the resulting compound, H;Lilr,O,, has 
low-temperature properties that are spec- 
tacularly different from those of Li,IrO,. In 
particular, they observed no magnetic order- 
ing in H,Lilr,O, down to a temperature of 
0.05 K, and used nuclear magnetic resonance 
to confirm these findings down to 1 K (see 
Figure 2 of the paper’). The complex inter- 
play between Kitaev and Heisenberg inter- 
actions” is probably key to stabilizing this 
quantum state (Fig. 1). The authors discov- 
ered that the compound’s thermodynamic and 
magnetic properties are highly unusual, and 
interpreted these measurements in terms of 
exotic magnetic excitations that lack an excita- 
tion gap and perhaps have fractional quantum 
numbers. 

H,Lilr,O, is a remarkable compound: it is 
the first iridium-based honeycomb magnet 
that does not become magnetically ordered 
at temperatures below a few kelvin. However, 
its overall behaviour and unusual thermo- 
dynamic properties indicate that it is governed 
by microscopic ingredients that differ sub- 
stantially from those of Kitaev’s model. This 


is not a curse, but a blessing: understanding 
exotic magnetic phases is often achieved by 
studying related materials and models”. Future 
experimental work in which, for example, 
large crystals of H;Lilr,O, are grown, or parti- 
cles such as neutrons or photons are scattered 
off H,Lilr,O,, could reveal whether the com- 
pound’s excitations have fractional quantum 
numbers — the ultimate experimental proof 
of a quantum spin liquid. 


MEDICAL RESEARCH 


The soft-chemistry approach used by 
Kitagawa et al. offers great promise for control- 
ling the properties of layered quantum mag- 
nets. However, with regard to oxide materials, 
there are challenges associated with the pres- 
ence of chemical disorder and heterogeneities. 
For instance, the layered structure of H,Lilr,O, 
is prone to faults associated with the stacking 
of the layers”. In the future, it will be excit- 
ing to see chemists and physicists join forces 
to develop a deeper understanding of how 
material defects influence, and potentially even 
favour, entangled magnetic matter. m 
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On the trail of invasive 
cells in breast cancer 


During breast-cancer progression, tumour cells that arise in the milk duct 
spread elsewhere in the breast. The origin of these invasive tumour cells is now 
revealed by an analysis of spatially defined single cells. 


DOUGLAS S. MICALIZZI 
& SHYAMALA MAHESWARAN 


ductal carcinoma in situ (DCIS), a cluster 

of cancer cells arises in the milk duct and 
remains confined there. The recorded inci- 
dence of DCIS has risen since the late 1980s, 
probably due to an increase in its detection 
through more widespread breast-cancer 
screening’. Up to 40% of cases of DCIS pro- 
gress to invasive ductal carcinoma (IDC), 
in which tumour cells invade other regions 
of the breast” *. IDC requires clinical treat- 
ment. Moreover, because it is not possible to 
predict which people with DCIS are at risk of 
progressing to IDC, the standard treatment 


I: the early-stage breast cancer called 


308 | NATURE | VOL 554 |15 FEBRUARY 2018 
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


for patients who have DCIS is surgery, often 
followed by radiotherapy’. Therefore, under- 
standing how DCIS progresses to IDC might 
enable the selective treatment of those at high 
risk of developing IDC. A paper in Cell by 
Casasent et al.° reveals the origin of DCIS and 
its progression to IDC. 

Casasent and colleagues obtained frozen 
samples of breast-cancer tissue gathered 
from patients. The authors used single-cell 
DNA sequencing of cells from DCIS and IDC 
regions in the same section of tissue to create 
molecular maps of patients’ breast cancers, in 
an approach they call topographic single-cell 
sequencing. They stained the samples to iden- 
tify tumour cells and noted the location of each 
selected cell in the regions of DCIS and IDC. A 
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ductal carcinoma in situ (DCIS), a cluster 

of cancer cells arises in the milk duct and 
remains confined there. The recorded inci- 
dence of DCIS has risen since the late 1980s, 
probably due to an increase in its detection 
through more widespread breast-cancer 
screening’. Up to 40% of cases of DCIS pro- 
gress to invasive ductal carcinoma (IDC), 
in which tumour cells invade other regions 
of the breast” *. IDC requires clinical treat- 
ment. Moreover, because it is not possible to 
predict which people with DCIS are at risk of 
progressing to IDC, the standard treatment 
for patients who have DCIS is surgery, often 
followed by radiotherapy’. Therefore, under- 
standing how DCIS progresses to IDC might 
enable the selective treatment of those at high 
risk of developing IDC. A paper in Cell by 
Casasent et al.’ reveals the origin of DCIS and 
its progression to IDC. 

Casasent and colleagues obtained frozen 
samples of breast-cancer tissue gathered 
from patients. The authors used single-cell 
DNA sequencing of cells from DCIS and IDC 
regions in the same section of tissue to create 
molecular maps of patients’ breast cancers, in 
an approach they call topographic single-cell 
sequencing. They stained the samples to iden- 
tify tumour cells and noted the location of each 
selected cell in the regions of DCIS and IDC. A 
technique called laser capture microdissection 
enabled the separation of a selected cell from 
its neighbours by tracing a laser beam around 
the cell. This was combined with an approach 
termed laser catapulting, in which energy from 
an ultraviolet laser propels a microdissected cell 
into a collection tube for subsequent DNA anal- 
ysis. These laser-based techniques are used in 
other contexts, including forensic science and 
studies of plant physiology’. 

The authors analysed 1,293 of these 
individually isolated tumour cells from 
10 patients with breast cancer. They assessed 
whether the cells isolated from each patient 


I: the early-stage breast cancer called 


contained alterations in the number of cop- 
ies of genes, a type of change that sometimes 
drives tumour formation and growth. A 
comparison of these alterations in tumour 
cells from an individual can identify distinct 
populations of cells (clones). Analysis of the 
patterns of genetic alterations can reveal a 
clone’s cellular origin and its relationship with 
other clones. Combining this information with 
knowledge of a cell’s location offers a way to 
track the progression of DCIS to IDC. 
Casasent et al. observed that the tumours of 
four patients had formed froma single clone, 
and that those of the other six patients con- 
sisted of between 2 and 5 clones. The clones of 
each patient with multiclonal tumours shared 
a set of genomic aberrations, which suggests 
that all tumour cells in each of those patients 
had descended from a single cell, with genetic 
divergence between the various clones arising 


Milk duct 


Transient 
genomic 
instability 


=> 


Single 
tumour-initiating cell 


from the acquisition of further alterations. 

In patients with multiclonal tumours, all of 
the individual’s clones were present in both 
DCIS and IDC regions (Fig. 1). This points 
to a model in which clones arise and evolve 
in the milk duct and then escape to establish 
IDC elsewhere in the breast, such as in regions 
containing stromal cells. The various clones 
had a wide distribution across breast tissue, 
with certain clones being evenly distributed 
between milk ducts and regions of tumour 
invasion, and others being enriched in either 
the invaded regions or the milk ducts. 

The authors conducted additional analysis 
by sequencing the protein-coding regions of 
the tumour cells’ DNA — detecting an aver- 
age of 23 mutations per patient. This analysis 
confirmed that there were striking similari- 
ties between specific clones in a given patient, 
regardless of whether the clonal tumour 
cell was part of a DCIS or an IDC region. 
The transition from DCIS to IDC was not 
associated with a notable increase in the 
number of mutations in the clones or in the 
number of independent clones. A few mutations 
specific to DCIS or IDC regions were identified, 
and, although these warrant further study, the 
authors’ main finding is the remarkable genetic 
similarity of a patient’s tumour cells in these 
two distinct disease states. The observation that 
genomic evolution of tumour cells occurs in the 
milk duct before invasion proceeds suggests that 
a transient period of genome instability is an 
early event in breast-cancer initiation. 
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Figure 1 | Tracking the cellular origins of breast cancer. Casasent et al.° report the development of 

a technique for spatially resolved analysis of individual breast-cancer cells in tissue samples obtained 
from patients. This enabled them to determine the cellular origin of a non-invasive breast cancer called 
ductal carcinoma in situ (DCIS), in which tumour cells form in the milk duct. The authors reveal that 
DCIS arises from a single tumour-initiating cell and its descendants. Some of these tumour cells might 
diverge from each other during a period of transient genomic instability to form several different cellular 
populations, or clones (distinct clones shown as red, yellow or dark purple). DCIS sometimes progresses 
to an invasive stage in which tumour cells migrate to other areas of the breast tissue, such as regions 
containing stromal cells, where tumour growths known as invasive ductal carcinoma (IDC) form. The 
identity of the cells that give rise to IDC has been debated. Casasent and colleagues found that all the 
tumour clones present in regions of DCIS are also present in areas of IDC. 
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Casasent and colleagues’ model, in which 
breast cancer arises in the milk duct from a sin- 
gle initiating cell that gives rise to several clones 
in the duct that then invade the rest of the breast 
tissue, contrasts with two alternative models of 
IDC development*”*. The independent-lineage 
model*” proposes that DCIS and IDC evolve 
independently from different tumour-initiating 
cells. The other model”, called an evolutionary 
bottleneck, proposes that a single initiating cell 
in the milk duct gives rise to several clones, only 
one of which successfully escapes the milk duct 
to invade the rest of the tissue. 

Many questions remain. What triggers the 
genetic alterations in the tumour-initiating cell 
in DCIS? Do other potential tumour-initiating 
cells arise at the earliest stage of the disease, 
of which only one gives rise to DCIS and the 
others are eliminated? A better understanding 
is needed of how tumour cells escape the milk 
duct through the breakdown of the duct’s outer 
layer (the basement membrane), and how 
some cells from all the clones present in DCIS 
manage to exit the duct. 
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The genetic similarity between tumour 
cells in the DCIS and IDC regions of a given 
individual raises the question of what differ- 
ences enable some cells to give rise to invasive 
disease. A clearer definition of the differences 
between low-risk DCIS that does not advance 
to IDC and high-risk DCIS that does, might 
enable the accurate prediction of disease-pro- 
gression risk. Addressing this will probably 
require single-cell comparisons of gene-tran- 
scription profiles or the assessment of modi- 
fications known as epigenetic changes, which 
alter the protein-DNA complex of chromatin 
and can affect gene expression. 

Casasent and colleagues’ approach could 
be used to study different cancers in which 
spatially resolved insight into tumour growth 
cannot be obtained by conventional methods. 
For example, using this technique to investi- 
gate abnormal cellular growths that eventu- 
ally progress to a malignant state would bea 
good choice for future studies into the origin 
and clonal heterogeneity of tumour cells. It 
is unclear how widely this method is likely 


to be applied, considering the technical and 
financial requirements. Nevertheless, Casasent 
et al. have demonstrated that their approach 
has the potential to provide useful insight into 
the dynamics of tumour clones during the 
progression of breast cancer. m 
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The genus Citrus, comprising some of the most widely cultivated fruit crops worldwide, includes an uncertain number 
of species. Here we describe ten natural citrus species, using genomic, phylogenetic and biogeographic analyses of 60 
accessions representing diverse citrus germ plasms, and propose that citrus diversified during the late Miocene epoch 
through a rapid southeast Asian radiation that correlates with a marked weakening of the monsoons. A second radiation 
enabled by migration across the Wallace line gave rise to the Australian limes in the early Pliocene epoch. Further 
identification and analyses of hybrids and admixed genomes provides insights into the genealogy of major commercial 
cultivars of citrus. Among mandarins and sweet orange, we find an extensive network of relatedness that illuminates the 
domestication of these groups. Widespread pummelo admixture among these mandarins and its correlation with fruit 
size and acidity suggests a plausible role of pummelo introgression in the selection of palatable mandarins. This work 
provides a new evolutionary framework for the genus Citrus. 


The genus Citrus and related genera (Fortunella, Poncirus, 
Eremocitrus and Microcitrus) belong to the angiosperm subfamily 
Aurantioideae of the Rutaceae family, which is widely distributed 
across the monsoon region from west Pakistan to north-central China 
and south through the East Indian Archipelago to New Guinea and 
the Bismarck Archipelago, northeastern Australia, New Caledonia, 
Melanesia and the western Polynesian islands’. Native habitats of 
citrus and related genera roughly extend throughout this broad area 
(Extended Data Fig. la and Supplementary Table 1), although the 
geographical origin, timing and dispersal of citrus species across 
southeast Asia remain unclear. A major obstacle to resolving these 
uncertainties is our poor understanding of the genealogy of complex 
admixture in cultivated citrus, as has recently been shown”. Some 
citrus are clonally propagated apomictically* through nucellar embry- 
ony, that is, the development of non-sexual embryos originating in 
the maternal nucellar tissue of the ovule, and this natural process 
may have been co-opted during domestication; grafting is a rela- 
tively recent phenomenon’. Both modes of clonal propagation have 
led to the domestication of fixed (desirable) genotypes, including 
interspecific hybrids, such as oranges, limes, lemons, grapefruits and 
other types. 

Under this scenario, it is not surprising that the current chaotic citrus 
taxonomy—based on long-standing, conflicting proposals*°—requires 
asolid reformulation consistent with a full understanding of the hybrid 
and/or admixture nature of cultivated citrus species. Here we analyse 
genome sequences of diverse citrus to characterize the diversity and 
evolution of citrus at the species level and identify citrus admixtures 
and interspecific hybrids. We further examine the network of related- 
ness among mandarins and sweet orange, as well as the pattern of the 
introgression of pummelos among mandarins for clues to the early 
stages of citrus domestication. 


Diversity and evolution of the genus Citrus 

To investigate the genetic diversity and evolutionary history of citrus, 
we analysed the genomes of 58 citrus accessions and two outgroup 
genera (Poncirus and Severinia) that were sequenced to high cover- 
age, including recently published sequences”*” as well as 30 new 
genome sequences described here. For our purpose, we do not include 
accessions related by somatic mutations. These sequences represent 
a diverse sampling of citrus species, their admixtures and hybrids 
(Supplementary Tables 2, 3 and Supplementary Notes 1, 2). Our col- 
lection includes accessions from eight previously unsequenced and/or 
unexamined citrus species, such as pure mandarins (Citrus reticulata), 
citron (Citrus medica), Citrus micrantha (a wild species from within 
the subgenus Papeda), Nagami kumquat (Fortunella margarita, also 
known as Citrus japonica var. margarita), and Citrus ichangensis (also 
known as Citrus cavaleriei; this species is also considered a Papeda), as 
well as three Australian citrus species (Supplementary Notes 3, 4). For 
each species, we have sequenced one or more pure accessions without 
interspecific admixture. 

Local segmental ancestry of each accession can be delineated for 
both admixed and hybrid genotypes, based on genome-wide ancestry- 
informative single-nucleotide polymorphisms (Supplementary Note 5). 
Comparative genome analysis further identified shared haplotypes 
among the accessions (Supplementary Notes 6, 7). In particular, we 
demonstrate the F1 interspecific hybrid nature of Rangpur lime and red 
rough lemon (two different mandarin-citron hybrids), Mexican lime 
(a micrantha-citron hybrid) and calamondin (a kumquat-mandarin 
hybrid), and confirm, using whole-genome sequence data, the origins 
of grapefruit (a pummelo-sweet orange hybrid), lemon (a sour orange- 
citron hybrid) and eremorange (a sweet orange and Eremocitrus glauca 
(also known as Citrus glauca) hybrid). We also verified the parentage 
of Cocktail grapefruit, with low-acid pummelo as the seed parent and 
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King and Dancy mandarins as the two grandparents on the paternal 
side. The origin of the Ambersweet orange is similarly confirmed to be 
a mandarin-sweet orange hybrid with Clementine as a grandparent. 
We have previously shown that sour orange (cv. Seville) (Citrus 
aurantium) is a pummelo—mandarin hybrid, and have analysed the 
more complex origin of sweet orange (Citrus sinensis)”. Re-analysing 
sequences from ten cultivars of sweet orange® shows that they are all 
derived from the same genome by somatic mutations, and were thus 
not included in our study. 

We identified ten progenitor citrus species (Supplementary Note 4.1) 
by combining diversity analysis (Extended Data Table 1), multi- 
dimensional scaling and chloroplast genome phylogeny (Extended Data 
Fig. 1b). The first two principal coordinates in the multidimensional scal- 
ing (Fig. 1a) separate three ancestral (sometimes called ‘fundamental’) 
Citrus species associated with commercially important types*’— 
citrons (C. medica), mandarins (C. reticulata) and pummelos (Citrus 
maxima)—and display lemons, limes, oranges and grapefruits as 
hybrids involving these three species. The nucleotide diversity distri- 
butions (Fig. 1b) show distinct scales for interspecific divergence and 
intraspecific variation, and reflect the genetic origin of each accession. 
Hybrid accessions (sour orange, calamondin, lemon and non-Australian 
limes) with ancestry from two or more citrus species are readily iden- 
tified on the basis of their higher segmental heterozygosity (1.5-2.4%) 
relative to intraspecific diversity (0.1-0.6%). Other citrus accessions 
show bimodal distributions in heterozygosity (sweet orange, grape- 
fruits and some highly heterozygous mandarins) due to interspecific 
admixture, a process that generally involves complex backcrosses. 
Among the pure genotypes without interspecific admixture, citrons 
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show significantly lower intraspecific diversity (around 0.1%) than 
the other species (0.3-0.6%). The reduced heterozygosity of citrons, 
a mono-embryonic species, is probably due to the cleistogamy of its 
flowers!°, a mechanism that promotes pollination and self-fertilization 
in unopened flower buds, which in turn reduces heterozygosity. 

The identification of a set of pure citrus species provides new insights 
into the phylogeny of citrus, their origins, evolution and dispersal. 
Citrus phylogeny is controversial’>*'!!”, in part owing to the diffi- 
culty of identifying pure or wild progenitor species, because of sub- 
stantial interspecific hybridization that has resulted in several clonally 
propagated and cultivated accessions. Some authors assign separate 
binomial species designations to clonally propagated genotypes)®. Our 
nuclear genome-based phylogeny, which is derived from 362,748 single- 
nucleotide polymorphisms in non-genic and non-pericentromeric 
genomic regions, reveals that citrus species are a monophyletic group 
and establishes well-defined relationships among its lineages (Fig. 1c 
and Supplementary Note 8). Notably, the nuclear genome-derived 
phylogeny differs in detail from the chloroplast-derived phylogeny 
(Extended Data Fig. 1). This is not unexpected, as chloroplast DNA is 
a single, non-recombining unit and is unlikely to show perfect lineage 
sorting during rapid radiation (Supplementary Note 8.3). 

The origin of citrus has generally been considered to be in southeast 
Asia!, a biodiversity hotspot’ with a climate that has been influenced 
by both east and south Asian monsoons" (Supplementary Note 9). 
Specific regions include the Yunnan province of southwest China’, 
Myanmar and northeastern India in the Himalayan foothills!. A fossil 
specimen from the late Miocene epoch of Lincang in Yunnan, Citrus 
linczangensis'®, has traits that are characteristic of current major citrus 
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Figure 2 | Admixture proportion and citrus 
genealogy. a, Allelic proportion of five progenitor 
citrus species in 50 accessions. CI, C. medica; FO, 
Fortunella; MA, C. reticulata; MC, C. micrantha; 
PU, C. maxima; UNK, unknown. The pummelos 
and citrons represent pure citrus species, whereas 
in the heterogeneous set of mandarins, the degree 
of pummelo introgression subdivides the group 
into pure (type-1) and admixed (type-2 and -3) 
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groups, and provides definite evidence for the existence of acommon 
Citrus ancestor within the Yunnan province approximately 8 million 
years ago (Ma). 

Our analysis establishes a relatively rapid Asian radiation of citrus 
species in the late Miocene (6-8 Ma; Fig. Ic, d), a period coincident 
with an extensive weakening of monsoons and a pronounced climate 
transition from wet to drier conditions!”. In southeast Asia, this marked 
climate alteration caused major changes in biota, including the migra- 
tion of mammals" and rapid radiation of various plant lineages!®”°. 
Australian citrus species form a distinct clade that was proposed to be 
nested with citrons”, although distinct generic names (Eremocitrus and 
Microcitrus) were assigned in botanical classifications by Swingle!* 
Both molecular dating analysis”! and our whole-genome phylogenetic 
analysis do not support an Australian origin for citrus”. Rather, citrus 
species spread from southeast Asia to Australasia, probably via trans- 
oceanic dispersals. Our genomic analysis indicates that the Australian 
radiation occurred during the early Pliocene epoch, around 4 Ma. This 
is contemporaneous with other west-to-east angiosperm migrations 
from southeast Asia?*4, presumably taking advantage of the eleva- 
tion of Malesia and Wallacea in the late Miocene and Pliocene”*”° 
(Supplementary Note 9). 

The nuclear and chloroplast genome phylogenies indicate that 
there are three Australian species in our collection. One of the two 
Australian finger limes shows clear signs of admixture with round 
limes (Supplementary Note 5.4). The closest relative to Australian cit- 
rus is Fortunella, a species that has been reported to grow in the wild 
in southern China?’. Australian citrus species are diverse, and found 
natively in both dry and rainforest environments in northeast Australia, 
depending on the species”®. Our phylogeny shows that the progenitor 
citrus probably migrated across the Wallace line, a natural barrier for 
species dispersal from southeast Asia to Australasia, and later adapted 
to these diverse climates. 

The results also show that the Tachibana mandarin, naturally found 
in Taiwan, the Ryukyu archipelago and Japan”, split from mainland 
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crosses between two parental genotypes, whereas 
red lines represent more complex processes 
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or backcrosses. Whereas type-1 mandarins 

(e) are pure species, type-2 (early-admixture) 
mandarins contain a small amount of pummelo 
admixture that can be traced back to a common 
pummelo ancestor (with P1 or P2 haplotypes). 
Later, additional pummelo introgressions into 
type-2 mandarins gave rise to both type-3 (late- 
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Asian mandarins (Fig. 1c, d) during the early Pleistocene (around 
2 Ma), a geological epoch with strong glacial maxima*”. Tachibana, as 
did other flora and fauna in the region, very probably arrived in these 
islands from the adjacent mainland*! during the drop in the sea level 
of the South China Sea and the emergence of land bridges*”*?, a pro- 
cess promoted by the expansion of ice sheets that repetitively occurred 
during glacial maxima (Supplementary Note 9). 

Although Tachibana®® has been assigned its own species (Citrus 
tachibana), sequence analysis reveals that it has a close affinity to 
C. reticulata***> and does not support its taxonomic position as a 
separate species (Supplementary Note 4.1). However, both chloro- 
plast genome phylogeny (Extended Data Fig. 1b) and nuclear genome 
clustering (Fig. 1a) clearly distinguish Tachibana from the mainland 
Asian mandarins. This suggests that Tachibana should be designated a 
subspecies of C. reticulata. By contrast, the wild Mangshan ‘mandarin’ 
(Citrus mangshanensis)’ represents a distinct species, with compara- 
ble distances to C. reticulata, pummelo and citron” (Extended Data 
Table 1). 


Pattern of pummelo admixture in the mandarins 

Using 588,583 ancestry-informative single-nucleotide polymorphisms 
derived from three species, C. medica, C. maxima and C. reticulata, 
we delineate the segmental ancestry of 46 citrus accessions (Extended 
Data Fig. 2 and Supplementary Note 5). Pummelo admixture is found 
in all but 5 of the 28 sequenced mandarins, and the amount and pattern 
of pummelo admixture, as identified by phased pummelo haplotypes 
(Fig. 2a and Supplementary Note 6), suggests the classification of the 
mandarins into three types. 

Type-1 mandarins represent pure C. reticulata with no evidence of 
interspecific admixture and include Tachibana, three unnamed Chinese 
mandarins (M01, M02, M04) and the ancient Chinese cultivar Sun 
Chu Sha Kat reported here, a small tart mandarin commonly grown 
in China and Japan, and also found in Assam. This cultivar is likely 
described in Han Yen-Chih’s ap 1178 monograph ‘Chi Lu’”®, which 
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Figure 3 | Citrus relatedness network and 
haplotype sharing with sweet orange. 

a, Genetic relatedness among 48 citrus 
accessions derived from four progenitor 
species including citrons, pummelos, pure 
mandarins and micrantha. Solid lines connect 
pairs with coefficient of relatedness r > 0.45, 
with parent-child pairs denoted by arrows 
pointing from parent to child. Dashed and 
dotted lines correspond to 0.35 <r <0.45 

and 0.25 <r <0.35, respectively. Mandarins 
are distinguished from other taxonomic 
groups by an extensive relatedness network, 
indicating shared haplotypes in the ancestral 
gene pool. Three-letter code as in Fig. 1, see 
Supplementary Table 2 for details. b, Shown in 
decreasing order are the values of coefficient 
of relatedness between sweet orange and other 
accessions, with C. maxima (rp) and C. reticulata 
(rw) components in green and light salmon, 
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includes references to citrus cultivated during the reign of Emperor Ta 
Yu (2205-2197 Bc). Sixteen of the twenty-eight mandarins belong to 
type-2 mandarins, which have a small amount of pummelo admixture 
(1-10% of the length of the genetic map; Fig. 2a), usually in the form 
of a few short segments distributed across the genome. Although the 
lengths and locations of these admixed segments may be distinct in 
different mandarins, they share one or two common pummelo haplo- 
types (designated as P1 and P2) (Extended Data Fig. 3). By contrast, 
the seven remaining mandarins (type-3) contain higher proportions of 
pummelo alleles (12-38%; Fig. 2a) in longer segments. Although the P1 
and P2 pummelo haplotypes are also detectable among type-3 manda- 
rins, other more extensive pummelo haplotypes dominate the pummelo 
admixture in type-3 mandarins (Fig. 2b and Extended Data Table 2). 
These observations suggest that the initial pummelo introgression 
into the mandarin gene pool may have involved as few as one pummelo 
tree (carrying both P1 and P2 haplotypes), the contribution of which 
was diluted by repeated backcrosses with mandarins (Supplementary 
Note 6.3). The introgressed pummelo haplotypes became widespread 
and gave rise to type-2 (early-admixture) mandarins (Fig. 2b). We pro- 
pose that later, additional pummelo introgressions gave rise to type-3 
(late-admixture) mandarins and sweet orange, and that some modern 
type-3 mandarins were derived from hybridizations among existing 
mandarins and sweet orange. This late-admixture model for type-3 
mandarins is consistent with the historical records for Clementine and 
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Kiyomi (both mandarin-sweet orange hybrids), and for W. Murcott, 
Wilking and Fallglo (hybrids involving other type-3 mandarins), 
whereas definitive records for the remaining two late-admixture man- 
darins (King and Satsuma) are not available. 


Domestication of mandarins and sweet orange 

Citrus domestication probably began with the identification and 
asexual propagation of selected, possibly hybrid or admixed individ- 
uals, rather than recurrent selection from a breeding population as 
for annual crops*”8, Additional diversity was obtained by capturing 
somatic mutations that occur within a relatively few basic genotypes. 
Therefore, conventional approaches to identifying selective pressures 
under recurrent breeding*® cannot be applied. We can, however, 
use genome sequences to infer some features of the early stages of 
citrus domestication. Here we focus on mandarins, a class of citrus 
comprising small and easily peeled fruits that are of high commercial 
value. 

All 28 mandarin accessions, except for Tachibana, exhibit an exten- 
sive network of relatedness (with a coefficient of relatedness, r > 1/8), 
and all but four mandarins (three of the four are pure or type-1 
mandarins) show second degree or higher relatedness (r > 1/4) to 
at least one (mean = 7) other mandarin (Fig. 3a and Supplementary 
Note 7). By contrast, sequenced pummelos and citrons appear to 
be independent selections from relatively large populations. In the 
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absence of historical records for most mandarins, the actual kinships 
are difficult to infer, owing to extensive haplotype sharing among 
the ancestors, although some parent-child pairs can be identified. 
In addition to confirming, using the whole-genome sequence, the 
parentage of Wilking (King-Willowleaf), Kiyomi (Satsuma-sweet 
orange) and Fallglo (one grandparent is Clementine), we find parent- 
child relationships between two pairs of mandarins (Ponkan is a 
parent of Dancy; Huanglingmiao (a somatic mutant of Kishu) is 
a parent of Satsuma)**, in addition to the previously established 
parent-child pair of Willowleaf and Clementine mandarins’. 
Additional parent-child pairs involving the recently sequenced 
Chinese mandarins? are also identified (Supplementary Note 7.3). 
A few cultivar types in this network (Satsuma, Dancy, Clementine, 
Kiyomi, Fallgo and the Chinese cultivar BT] mandarins) have marked 
signs of inbreeding, indicated by runs of homozygosity (Extended 
Data Fig. 4a) as a result of shared haplotypes between their parents. 
The high degree of relatedness among mandarins implies extensive 
sharing of C. reticulata haplotypes. 

Sweet orange also shows extensive haplotype sharing at the level of 
r>0.1 with 25 of the 28 sequenced mandarins (except for three pure 
or type-1 mandarins; Fig. 3b and Extended Data Fig. 4b). Two late- 
admixture mandarins (Clementine and Kiyomi) are direct offspring 
of sweet orange. Among the early-admixture (type-2) mandarins, 
Ponkan shows the highest affinity to sweet orange” with r~ 0.36. Even 
the pure mandarin, Sun Chu Sha Kat has r= 0.23, equivalent to second 
degree relatedness to sweet orange. We can rule out the scenario that 
sweet orange is the common ancestor of the mandarins, because of a 
lack of pummelo haplotypes (derived from sweet orange) among the 
mandarins. Rather, the extensive C. reticulata haplotype sharing 
between sweet orange and mandarins suggests that the mandarin 
parent of sweet orange was part of an expansive network of relatedness 
among mandarins. 

Because our collection of mandarins represents a diverse set of 
both ancient and modern varieties, including economically impor- 
tant accessions with mostly unknown parentage, the presence of an 
extensive relatedness network was not anticipated a priori. The shared 
C. reticulata haplotypes are suggestive of and consistent with signatures 
of the human selection process, during which mandarins with desirable 
traits were necessarily maintained through clonal propagation (nucellar 
polyembryony or grafting). Although one cannot preclude the possi- 
bility that the relatedness network was initiated before domestication 
from a small number of founder trees, human selection of accessions 
resulting from natural hybridization probably had a key role in the 
process of domestication that eventually led to the extensive related- 
ness network observed today. For example, modern mandarins, such 
as Clementine and W. Murcott, are known to be selections from chance 
seedlings found in Algeria*® and Morocco’, at the onset and middle of 
the last century, respectively. 

Pummelo admixture is correlated with fruit size and acidity, suggest- 
ing a role for pummelo introgression in citrus domestication. As both 
fruit size and acidity profile for the most recently sequenced accessions* 
are not described, we used 37 citrus accessions in this analysis. We 
find that the fruit sizes of mandarins, oranges, grapefruit and pum- 
melos show a strong positive correlation (Pearson correlation coeffi- 
cient r=0.88) with pummelo admixture proportion (Extended Data 
Fig. 5a, b and Supplementary Note 10.1). In addition to fruit size, a 
pivotal driver of fruit domestication is palatability, a characteristic 
that in citrus requires low to moderate levels of acidity. In mandarins, 
palatability appears to be linked to pummelo introgression at a major 
locus at the start of chromosome 8 (0.3—2.2 Mb), where all nine known 
palatable mandarins, but none of the four known acidic mandarins, 
show pummelo admixture in at least part of the genomic region 
(Extended Data Fig. 3). This locus is also found to be significant in a 
genome scan for palatability association (Extended Data Fig. 5c, d and 
Extended Data Table 3) and contains several potentially relevant genes 
(Supplementary Note 10.2). Among these genes is a gene encoding 
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the mitochondrial NAD*-dependent isocitrate dehydrogenase (IDH) 
which regulates citric-acid synthesis*! (Extended Data Table 4). 

Our study finds that domesticated citrus fruit crops, such as man- 
darins and sweet orange, experienced a complex history of admix- 
ture, conceptually similar to those well-recognized in annual crops, 
such as rice** and maize*, and in other fruit trees, such as apple“* 
and grape”, for which the current genomic diversity is linked to 
widespread ancient introgression. Other cultivated citrus groups, 
the interspecific F1 hybrids in particular, originated from hybridiza- 
tions of two pure parental species. Several of these involve C. medica 
(citron), including limes and lemons!°. A unique and critical char- 
acteristic of the three pivotal species (C. maxima, C. reticulata and 
C. medica) that gave rise to most cultivated citrus fruits is the occur- 
rence of a complex floral anatomy (Extended Data Fig. 6), thus lead- 
ing to the development of more complex fruit. Other species were also 
involved in hybridizations, including Fortunella and C. micrantha. 
Distinct from the mandarin lineages, these hybrids are characterized 
by their acidic fruit, and their selection must have been made on the 
basis of other characteristics, such as a sweet edible peel and aroma’, 
respectively. 


Conclusion 

On the basis of genomic, phylogenetic and biogeographic analyses of 
60 diverse citrus and related accessions, we propose that the centre of 
origin of citrus species was the southeast foothills of the Himalayas, in 
a region that includes the eastern area of Assam, northern Myanmar 
and western Yunnan. Our analyses suggest that the ancestral citrus 
species underwent a sudden speciation event during the late Miocene. 
This radiation coincided with a pronounced transition from wet mon- 
soon conditions to a drier climate, as observed in nearby areas in 
many other plant and animal lineages. The Australian citrus species 
and Tachibana, a native Japanese mandarin, split later from main- 
land citrus during the early Pliocene and Pleistocene, respectively. 
By distinguishing between pure species, hybrids and admixtures, 
we could trace the genealogy and genetic origin of the major cit- 
rus commercial cultivars. Both the extensive relatedness network 
among mandarins and sweet orange, and the association of pummelo 
admixture with desirable fruit traits suggest a complex domestication 
process. 

Our work challenges previous proposals for citrus taxonomy. For 
example, we find that several named genera (Fortunella, Eremocitrus 
and Microcitrus) are in fact nested within the citrus clade. These and 
other distinct clades that we have identified are therefore more appro- 
priately considered species within the genus Citrus, on a par with those 
that formerly were referred to as the three ‘true’ or ‘biological’ species 
(C. reticulata, C. maxima and C. medica). Additionally, the related genus, 
Poncirus, a subject of continuous controversy since it was originally 
proposed to be within the genus Citrus'*®, is clearly a distinct clade 
that is separate from Citrus based on sequence divergence and whole- 
genome phylogeny. 

In summary, this work presents insights into the origin, evolution 
and domestication of citrus, and the genealogy of the most important 
wild and cultivated varieties. Taken together, these findings draw a new 
evolutionary framework for these fruit crops, a scenario that challenges 
current taxonomic and phylogenetic thoughts, and points towards a 
reformulation of the genus Citrus. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample collection and sequencing. Whole-genome sequences from a total of 
60 accessions were analysed: 58 citrus accessions with different geographical ori- 
gins and two representative outgroup genera. Twelve of these genomes, including 
five mandarins, four pummelos, two oranges and a wild Mangshan mandarin 
(C. mangshanensis) were reanalysed from previous works”’. We also reanalysed 
19 genomes from Chinese collections, including 15 unnamed mandarins, 
2 Chinese sour oranges, Ambersweet orange and Cocktail grapefruit (a hybrid 
resembling grapefruit) that have been previously reported’. 

The 30 accessions that were newly sequenced came from citrus germ-plasm 
banks and collections at IVIA, Valencia, Spain; SRA, Corse, France; UCR, Riverside 
and FDACS/DPI, Florida and included nine mandarins, two limes, one rough 
lemon, one grapefruit, one lemon, four citrons, one Australian desert lime, one 
eremorange, two Australian finger limes, two Australian round limes, one kum- 
quat, one calamondin, one micrantha, one Ichang papeda, one trifoliate orange 
and one Chinese box orange (Supplementary Note 1). 

DNA libraries were constructed using standard protocols with some modifica- 

tions. Library insert sizes range from 325 to 500 bp. Sequencing was performed on 
HiSeq2000/2500 instruments using 100-bp paired-end reads. Primary analysis of 
the data included quality control on the Illumina RTA sequence analysis pipeline 
(Supplementary Note 2). 
Variant calls and Citrus species diversity. Illumina paired-end reads were aligned 
to the haploid Clementine reference sequence? and the sweet orange chloroplast 
genome assembly*’ using bwa-mem'*. PCR duplicates were removed using Picard. 
Raw variants were called using GATK HaplotypeCaller” with subsequent filter- 
ing based on read map quality score, base quality score, read depth and so on 
(Supplementary Note 3.1). 

Interspecific admixtures versus pure citrus species were distinguished based 
on sliding window analysis of heterozygosity and pairwise genetic distance D 
(Supplementary Note 4). Genome-wide ancestry informative markers for the 
progenitor species were derived using pure accessions. Admixture analysis was 
carried out in sliding windows using ancestry informative markers (Supplementary 
Notes 5). 

Citrus relatedness and haplotype sharing. Interspecific phasing was used to 
extract admixed haplotypes. Identical-by-descent sharing was calculated for each 
of the non-overlapping sliding windows across the genome and used to estimate 
coefficient of relatedness among citrus accessions (Supplementary Notes 6, 7). 

Phylogeny and speciation dating. We used Chinese box orange (genus Severinia) 
as an outgroup. Time calibration is based on the C. linczangensis'® fossil from 
Lincang, Yunnan, China. MrBayes”? was used for whole genome Bayesian 
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phylogenetic inference, and corroborated with a PhyML” reconstructed maximum 
likelihood tree. A penalized likelihood method™ as implemented in APE* was 
used to construct the chronogram (Supplementary Note 8). 

Genome scan of palatability association. We used a mixed linear model as imple- 
mented in gemma™ for a case-control study of citrus acidity and palatability with 
37 citrus accessions. A conservative Bonferroni correction was used to select 
significant genomic loci, with subsequent manual examination of each candidate 
variant in all accessions to identify most discriminatory loci for fruit palatability 
(Supplementary Note 10). 

Data availability. Whole-genome shotgun-sequencing data generated in this 
study have been deposited at NCBI under BioProject PRJNA414519. Prior rese- 
quencing data analysed here can be accessed under BioProject accession numbers 
PRJNA320985 (mandarins) and PRJNA321100 (oranges), and also under the NCBI 
Sequence Read Archive accession codes SRX372786 (sour orange), SRX372703 
(sweet orange), SRX372702 (low-acid pummelo), SRX372688 (Chandler 
pummelo), SRX372685 (Willowleaf mandarin), SRX372687 (W. Murcott manda- 
rin), SRX372665 (Ponkan mandarin) and SRX371962 (Clementine mandarin). 
The Clementine reference sequence used here is available at https://phytozome. 
jgi.doe.gov/. 
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Extended Data Figure 1 | Citrus biogeography and phylogeny. 

a, Geographical distribution of the genus Citrus in southeast Asia 

and Australia. Distribution is based on documented reports on the 
presence of wild genotypes representative of pure citrus species (blue 
numbers), admixtures (red numbers) and relevant interspecific hybrids 
(black numbers), growing freely in non-cultivated areas. Numbers are 

as in Supplementary Table 1. 1, 2, Fortunella spp. ; 3, Australian citrus 

(E. glauca; Microcitrus australasica; Microcitrus australis); 4, C. micrantha; 
5-8, C. ichangensis; 9-15, C. medica; 16-19, C. maxima; 20-22, C. reticulata 
(Sun Chu Sha Kat); 23-25, C. tachibana; 26, C. mangshanensis; 27-29, 
Citrus spp. (mandarins); 30-33, C. sinensis; 34, 35, Citrus limon (probably 
not truly wild genotypes); 36, 37, Citrus limonia; 38, Citrus jambhiri; 
39-42, C. aurantium; 43, Citrus aurantifolia (probably not truly wild 
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genotypes); 44, 45, Fortunella and C. reticulata hybrid. The red star 
indicates the location of the C. linczangensis fossil from the Late 
Miocene of Lincang'®. b, Citrus chloroplast genome phylogeny rooted 
with Severinia. The analysis was performed on 58 citrus accessions and 
2 outgroup genera, Poncirus and Severinia. The maximum likelihood 
tree as inferred from PhyML is shown. Percentage statistical support for 
the nodes is based on 200 bootstrap replicates. c, Citrus nuclear genome 
phylogeny rooted with Severinia. Both Bayesian and maximum likelihood 
trees yield the same topology with highly supported branches. The 
maximum likelihood tree reconstructed from PhyML is shown. Branch 
statistical support is based on 1,000 bootstraps and is shown if it is less 
than 100%. All branches have posterior probability 1.0 with Bayesian 


inference using MrBayes (not shown). 
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Extended Data Figure 2 | Segmental ancestry and admixture in citrus. pummelo admixture pattern. b, Segmental ancestry of an Australian finger 
a, Segmental ancestry of 46 citrus accessions derived from the three lime. Blue segments denote pure finger lime (genotype: F/F), and orange 
progenitor species of C. medica (CI), C. maxima (PU) and C. reticulata segments have Australian round lime admixture (genotype: F/R). Genomic 
(MA). UNK, unknown. Mandarins are divided into three types with regions are coloured in grey if segmental ancestry cannot be determined. 
type-1 representing pure mandarins. Types 2 and 3 are determined by the 
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Extended Data Figure 3 | Pattern of pummelo introgression in type-2 mandarins are in green. Left, haplotype tree for chr3:3.2-5.2 Mb. 
mandarins. a, Distinct admixed pummelo haplotypes among mandarins, Sweet orange, sour orange, and twelve of the sequenced mandarins are 
oranges and grapefruit are shown in different colours; the C. reticulata interspecific hybrids, and their phased C. maxima and C. reticulata 
haplotypes are masked. The admixture pattern separates the mandarins haplotypes are denoted by prepending, respectively, ‘p’ and ‘m’ to the 
into three groups, with type-1 representing pure mandarins. Type-2 corresponding accession codes. The nine type-2 mandarins share the same 
mandarins contain a small amount of pummelo admixture derived from pummelo haplotype (P1). Right, the haplotype tree for chr2:31.4-33.4 Mb. 
two C. maxima haplotypes: P1 (light blue colour) and P2 (dark blue), Two pummelo haplotypes (P1, P2) are shared among seven type-2 
suggesting as few as one common pummelo ancestor in the distant past. mandarins, with Ponkan mandarin containing both P1 and P2. Sweet 
Type-3 mandarins are characterized by both marked pummelo admixture orange also carries two pummelo haplotypes at this locus, denoted by 
and additional pummelo haplotypes besides P1 and P2. b, Haplotype pC.SWO (shared with Clementine) and pA.SWO (alternate haplotype). 


trees for two chromosome segments where pummelo haplotypes of 
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by Pc) if its genetic ancestry is C. reticulata or C. maxima, respectively. 
Similarly, hapA can take the form of C. reticulata (M, in orange colour) 
or C. maxima (Pa in light green) depending on its genetic ancestry. Shared 
sweet orange haplotypes in mandarins are coloured accordingly, except 
when both haplotypes of sweet orange are shared (IBD2) either as two 

C. reticulata haplotypes (Mac, dark red) or as interspecific hybrid (P + M, 
blue). Regions coloured in grey denote the absence of haplotype sharing 
between sweet orange and mandarin. 


Extended Data Figure 4 | Haplotype sharing in mandarins. a, Runs of 
homozygosity in mandarins. Heterozygosity is plotted in non-overlapping 
windows of 200 kb along the nine chromosomes of 7 mandarin accessions 
with the highest degree of inbreeding. Runs of homozygosity correspond 
to regions with zero heterozygosity as a result of haplotype sharing 
between the parents. b, Haplotype sharing between sweet orange and 
mandarins. The two haplotypes of sweet orange are denoted by hapC 
(transmitted to Clementine) and hapA (alternate), respectively. The hapC 
haplotype is coloured in red (denoted by Mc) or dark green (denoted 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Fruit size and acidity correlated with 
pummelo introgression. a, Fruit size strongly correlated with pummelo 
admixture. The diameters of mandarins, oranges and grapefruit are 
plotted against the corresponding pummelo admixture proportions. 

A simple linear regression is shown in red. The strong correlation 
(Pearson correlation coefficient r= 0.88) between fruit size and pummelo 
admixture is apparent, especially among the three taxonomic groups 

of mandarins, oranges and grapefruit. The outliers (Ponkan mandarin 
and four acidic mandarins) suggest that certain genomic loci could be 
more important than others in fruit size determination. Accessions 
without size information are not included. Data are mean ts.d. from 

a set of 25 measurements for each of the 15 accessions. b, Fruit size 
correlation with pummelo allelic proportion with the addition of two 
pummelos. A polynomial regression provides a better fit than simple 
linear regression (adjusted R* = 0.92). Data are mean +s.d. from a set 
of 25 measurements for each of the 17 accessions. c, Genome scan of 


ARTICLE 


significant loci associated with citrus acidity. Manhattan plot of a case- 
control analysis of a genome-wide association study (GWAS) of n = 37 
citrus accessions with known acidity profile. The horizontal dashed line 
denotes the conservative Bonferoni-corrected P value of 7.9 x 10-8 for 
genome-wide significance (a= 0.05). d, Manual inspection of candidate 
regions identified by GWAS (n= 37 accessions) demonstrates that in 
addition to the locus at chr1:23512067, single-nucleotide polymorphisms 
located in chromosome 8 are discriminatory for acidity. Shown are 40-kb 
zoom windows containing focal single-nucleotide polymorphisms (red 
vertical lines and green arrows) and gene models depicted by blue boxes 
in these two regions. TOE1, target of EGR1 protein 1 (Ciclev10007611; 
https://phytozome.jgi.doe.gov/pz/portal.html); NAD+-IDH, NAD*- 
dependent isocitrate dehydrogenase (IDH, Ciclev10028714); GH10, 
glycosyl hydrolase family 10 protein (Ciclev10028121) and MTOR, serine/ 
threonine protein kinase (Ciclev10027661). 
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Extended Data Figure 6 | Species characteristics of citrus. Reproductive images are not drawn to scale. Most mandarins in our collection are 
and vegetative characteristics among several species of the genus polyembryonic, though a few are monoembryonic, including Clementine. 
Citrus and related genera according to refs 1 and 55. The tree topology Other exceptions to the generalized description concerning embryo 
represented is that of the chronogram shown in Fig. 1c and citrus fruit numbers, in kumquat and other citrus species, can also be found. 
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Extended Data Table 1 | Citrus interspecies divergence and intraspecies diversity 
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Average pairwise sequence divergences between ten citrus species and two out groups (Poncirus and Severinia) are listed, in unit of 10-3. Each citrus species is represented by one diploid genotype 
free from interspecific admixture. Intraspecies variation is measured by the nucleotide diversity (that is, mean sequence divergence between the two haploid sequences of a diploid), and is represented 
along the diagonal in units of 10-3. The species names and the codes of the representative accessions are given in the last column and first row, respectively. Note the wide separation between 


interspecies divergence and intraspecies variation. See Supplementary Table 2 for details and definitions. 
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Extended Data Table 2 | Admixture proportions of 50 citrus accessions derived from five progenitor species 


Code PU/PU PU/MA MA/MA UNK 
Type-1 mandarins TBM,SCM,M01,M02,M04 0 0 1 0 
CLP 0.000 0.019 0.977 0.004 
SNK 0.000 0.025 0.975 0.000 
M17 0.000 0.045 0.949 0.006 
M12 0.000 0.047 0.944 0.009 
M03 0.000 0.055 0.937 0.007 
HLM 0.008 0.043 0.928 0.021 
M10 0.000 0.063 0.934 0.003 
Tye-2inandaine M15 0.000 0.080 0.915 0.005 
CSM 0.000 0.081 0.901 0.017 
M16 0.000 0.082 0.911 0.007 
M11 0.000 0.088 0.894 0.018 
DNC 0.000 0.114 0.882 0.004 
WLM 0.000 0.118 0.870 0.012 
M14 0.000 0.120 0.868 0.013 
PKM 0.010 0.124 0.863 0.003 
M08 0.015 0.173 0.804 0.008 
CLM 0.000 0.237 0.737 0.027 
WMM 0.000 0.252 0.732 0.016 
M19 0.031 0.214 0.747 0.008 
Type-3 mandarins M21 0.015 0.281 0.672 0.032 
UNS 0.018 0.411 0.563 0.008 
KNG 0.000 0.471 O:S19 0.009 
M20 0.099 0.565 0.327 0.009 
Sweet orange SwO 0.044 0.743 0.200 0.013 
Ambersweet or. SO5 0.104 0.543 0.318 0.035 
en $SO,BO2 0.000 1.000 0.000 0.000 
BO3 0.046 0.942 0.000 0.012 
Grapefruit PAR 0.328 0.670 0.000 0.002 
Cocktail grapef. GFO 0.067 0.933 0.000 0.000 
Pareto CHP 0.990 0.005 0.000 0.005 
LAP,GXP,STP 1 0 0 0 
Code CI/Cl CI/PU CI/MA UNK 
Rangpur lime LMA 0 0 1 0 
Rough lemon RRL 0 0 1 0 
Lemon LIM 0 0.364 0.620 0.016 
Citrons BUD,COR,HUM,VEU 1 0 0 0 
Code CI/Cl CI/MC MC/MC. UNK 
Micrantha MIC 0 0 1 0 
Mexican lime MXL 0 1 0 0 
Code MA/MA MA/FO FO/FO UNK 
Kumquat FOR 0 0 1 0 
Calamondin CAL 0 1 0 0 


These five species are C. medica (Cl), C. maxima (PU), C. reticulata (MA), C. micrantha (MC), Fortunella (FO). Estimates based on genetic map lengths. UNK, unknown. 
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Extended Data Table 3 | Alleles of candidate single-nucleotide polymorphisms associated with citrus palatability 


Code | 1:415175 | 1:23512067 | 1:23679916 | 1:24219222 | 2:15484525 | 2:15702160 | 5:35094706 | 5:35098538 | 8:325527 | 8.631678 | 8.927020 | 8:1149577 | 8:1149586 | 81174414 | 8:1413967 | 8:1651338 | 81655701 | 8:1722788 | 8:2058824 | 8:2060290 | 8:2063416 | 8:2137063 | 8:2174360 | 9:30789594 
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The association study is based on a case-control GWAS analysis of 37 accessions with known palatability. Grey, ancestral alleles detected in Severinia and Poncirus; red, derived alleles; yellow, 
heterozygous single-nucleotide polymorphisms. 
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Extended Data Table 4 | The DH gene variants 


SNP position 
Exon 1 Exon 4 Exon 4 
Species Cultivar Code 8:324328 8:326594 8:326608 


Citrus maxima ~ Chandler pummelo CHP ‘T/T GG cc 


Citrus maxima Low-acid pummelo/ LAP TIT G/G CIC 
Citrus maxima Guanxi pummelo GXP T/T G/G CIC 
Citrus maxima Shatian pummelo STP T/T G/G CIC 
Citrus sinensis Sweet orange SWO T/T G/G CIC 
Citrus paradisi Marsh grapefruit PAR T/T G/G CIC 
Citrus reticulata Ponkan mandarin PKM T/T G/T C/G 
Citrus clementina Clementine mandarin CLM T/T G/T C/G 
Citrus tangerina Dancy mandarin DNC T/T G/T C/G 
Citrus deliciosa Willowleaf mandarin WLM TIT G/T C/G 
Citrus unshiu Satsuma mandarin UNS TIT GIA C/G 
Citrus reticulata Huanglingmiao mandarin HLM T/T GIA C/G 
Citrus nobilis King mandarin KNG T/G GIA C/G 
Citrus reticulata Changsha mandarin CSM T/T AIT G/G 


Citrus reticulata W. Murcott mandarin WMM T/G GIA C/G 


Citrus aurantium Sour orange SSO. TG GIA CIG 


Citrus sunki Sunki mandarin SNK TIT AIT G/G 
Citrus reshni Cleopatra mandarin CLP T/G AIA G/G 
Citrus ichangesis Ichang papeda ICH T/G AIA G/G 
Citrus limon Eureka lemon LIM TIT AIA G/G 
Citrus micrantha Micrantha MIC AIA AIA G/G 
Citrus aurantifolia Mexican lime MXL AIA AJA G/G 
Fortunella margarita Kumquat. Nagami FOR TIA AIA G/G 
Citrus limonia Rangpur lime LMA TIA AIA G/G 
Citrus madurensis Calamondin CAL TIA AIA G/G 
Citrus medica Mac Veu Montain Citron VEU TIT AJA G/G 
Citrus medica Corsican citron COR TIT AIA G/G 
Citrus medica Buddha’s hand citron BUD TAL AJA G/G 
Citrus medica Humpang citron HUM TIT AJA G/G 
Microcitrus australasica Australian finger lime (BC2) = AFL G/G AIA G/G 
Microcitrus australis Australian round lime (Pure) ARL G/G AIA G/G 
Microcitrus australasica Australian finger lime (Pure) AFR G/G AIA G/G 
Microcitrus australis Australian round lime (Pure) ARR G/G AJA GIG 
Eremocitrus glauca Australian desert line ADR G/G AIA GIG 
Citrus jambhiri Red rough lemon RRL TIT AIT GIG 
Citrus reticulata Sun Chu Sha Kat mandarin SCM T/T TIT G/G 
Citrus tachibana Tachibana mandarin TBM G/G AIA G/G 


Alleles of non-synonymous single-nucleotide polymorphisms of the NAD*t-dependent isocitrate dehydrogenase (/DH) gene (Ciclev10028714) in 37 citrus accessions with known palatability. 
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Ketamine blocks bursting in the lateral 
habenula to rapidly relieve depression 


Yan Yang!?*, Yihui Cui!**, Kangning Sang!*, Yiyan Dong!*, Zheyi Ni!, Shuangshuang Ma! & Hailan Hu!? 


The N-methyl-D-aspartate receptor (NMDAR) antagonist ketamine has attracted enormous interest in mental health 
research owing to its rapid antidepressant actions, but its mechanism of action has remained elusive. Here we show that 
blockade of NUDAR-dependent bursting activity in the ‘anti-reward center’, the lateral habenula (LHb), mediates the 
rapid antidepressant actions of ketamine in rat and mouse models of depression. LHb neurons show a significant increase 
in burst activity and theta-band synchronization in depressive-like animals, which is reversed by ketamine. Burst- 
evoking photostimulation of LHb drives behavioural despair and anhedonia. Pharmacology and modelling experiments 
reveal that LHb bursting requires both NMDARs and low-voltage-sensitive T-type calcium channels (T-VSCCs). 
Furthermore, local blockade of NMDAR or T-VSCCs in the LHb is sufficient to induce rapid antidepressant effects. Our 
results suggest a simple model whereby ketamine quickly elevates mood by blocking NMDAR-dependent bursting activity 
of LHb neurons to disinhibit downstream monoaminergic reward centres, and provide a framework for developing new 


rapid-acting antidepressants. 


The discovery of the rapid antidepressant effects of the NUDAR 
antagonist ketamine is arguably the most important advance in the 
field of psychiatry in the past half century’. A single administration 
of ketamine elicits fast (in as little as half an hour) and sustained 
antidepressant effects both in humans!” and in animal models of 
depression*®, It also has a fast metabolic turnover rate, with a half-life 
of three hours’. This rapid ‘hit-and-go’ temporal profile suggests that 
ketamine is likely to act on a system that has ongoing activity with 
open NMDAR channels’. 

The LHb has recently been implicated in the coding of negative 
emotion?! and the pathophysiology of major depression!*"!*. The 
LHb hosts primarily glutamatergic neurons”, but it inhibits the brain’s 
reward centres (including the dopaminergic ventral tegmental area 
(VTA) and the serotonergic dorsal raphe nucleus (DRN), hypoactivity 
of which has been implicated in depression”®”') either through a relay at 
the \-aminobutyric acid (GABA )ergic rostromedial tegmental nucleus 
(RMT¢g) or via local interneurons within the VTA and DRN Pideeenee 
Although there is accumulating evidence that aberrant overactivity in 
the LHb is crucial to depression!*"'¥, it remains unknown how the spike 
patterns of LHb neurons are altered during depression and what role 
this may play in the aetiology of depression and the fast antidepressant 
effects of ketamine. 


Blockade of LHb NMDARs is antidepressant 

We found that congenitally learned helpless (cLH) rats’*"!’, which 
model several aspects of depression, showed an antidepressant response 
to systemic injection of ketamine (25 mg kg!) in the forced swim test 
(FST, Extended Data Fig. la). To assess whether ketamine exerts its 
antidepressant effects through the LHb, we performed bilateral infusion 
of ketamine into the LHb through dual-guide cannulae (Fig. 1a). Local 
infusion of ketamine (25 1g, 1 ul each side, Fig. 1b) into the LHb of cLH 
rats was sufficient to quickly rescue the depression-like behaviours, 
including behavioural despair (as measured by immobile time in the 
FST; Fig. 1c) and anhedonia (as measured by the sucrose preference test 
(SPT); Fig. 1d), one hour after infusion. These antidepressant effects 


were dosage-dependent (Fig. 1c, d). Liquid chromatography-tandem 
mass spectrometry (LC-MS/MS, Extended Data Fig. le-g) revealed 
that infusion of 25 1g ketamine through cannulae into the LHb resulted 
ina local ketamine concentration of 5.2 + 1.3 1M, in a similar range 
to that caused by intraperitoneal injection of 25 mg kg~' ketamine 
(11.0 + 1.7 LM, Fig. le). 

Local bilateral infusion of the specific NMUDAR antagonist 2-amino- 
5-phosphonopentanoic acid (AP5; 40 nmol, Fig. 1f) into the LHb 
effectively reduced the immobile time in the FST (Fig. 1g) and 
increased the sucrose preference in the SPT (Fig. 1h), similar 
to the effects of ketamine. Neither ketamine nor AP5 infusion 
changed general locomotion, as measured by the open field test 
(OFT; Extended Data Fig. 1b, c). Notably, infusion of the AMPAR 
(a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid receptor) 
antagonist NBQX (1,2,3,4-tetrahydro-6-nitro-2,3-dioxo-benzo[f] 
quinoxaline-7-sulfonamide; 1 nmol or 5 nmol, Fig. 1i) into the LHb 
did not cause significant antidepressant effects (Fig. 1)). 


Increased LHb burst firing in depression 

To investigate the activity patterns of LHb neurons, we performed 
whole-cell patch-clamp on neurons in LHb coronal slices and recorded 
spontaneous neuronal activity under current clamp at resting conditions 
(I=0 pA). As previously shown*>”®, LHb neurons were intrinsically 
active and fell into three categories: silent, tonic-firing and burst-firing 
(Fig. 2a—c). Tetrodotoxin (TTX; 1|1M) abolished action potentials but 
left a plateau potential in the bursting neurons (Fig. 2c), indicating that 
the ionic component of this plateau potential is not sodium, but might be 
calcium”. The three classes of neurons were distributed among different 
sub-nuclei of the LHb with no clear regional enrichment (Fig. 2f). The 
resting membrane potentials (RMPs) of LHb neurons were on average 
more depolarized than those in the hippocampus or cortex”® (Fig. 2d, e). 
Notably, bursting neurons had significantly more hyperpolarized RMPs 
than silent or tonic-firing neurons (Fig. 2d, e). Moreover, the frequen- 
cies of the intra-burst spikes, but not of the inter-burst spikes, correlated 
with the degree of hyperpolarization (Extended Data Fig. 2c, d). 
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Figure 1 | Local blockade of NMDARs in LHb is sufficient to elicit rapid 
antidepressant effects. a, Illustration of bilateral implantation of cannulae 
in the LHb of cLH rats. CTX, cortex; HPC, hippocampus. Bottom: white 
dashed lines indicate location of habenula. b-d, Rapid antidepressant 
effects of local bilateral infusion of ketamine into the LHb (b, 25, 2.5 or 
0.25 1g, 1 yl each side) in the FST (c) and the SPT (d). e, Concentration 

of ketamine in habenula (Hab) or ventral hippocampus (vHip) 1h after 
intraperitoneal (i.p.) injection or intra-LHb infusion of ketamine, as 
measured by LC-MS/MS. f-j, Acute effects of local bilateral infusion of 
AP5 (40 nmol, 1 ul each side, f-h) or NBQX (1 or 5nmol, 1 pl each side, 

i, j) into the LHb on the FST (g, j) and SPT (h). Infusion sites of drugs 
verified by CTB (cholera toxin subunit B) (b, f, i). Data are mean + s.e.m. 
*P<0.05, **P < 0.01, NS, not significant. See Supplementary Table 1 for 
statistical analysis and n numbers. 


Although basic membrane properties, such as input resistance, and 
I-V relationships, and the average duration of bursts were indistin- 
guishable between neurons from cLH and wild-type rats (Extended 
Data Fig. 2), the percentage of bursting neurons increased significantly 
from 7% in wild-type controls to 23% in cLH rats (P= 0.003, Fig. 2g). 
The percentage of spikes in the bursting mode also increased from 7% 
in control to 43% in cLH rats (Fig. 2h). The inter-spike intervals (ISIs) 
of LHb neurons in cLH rats exhibited a clear bimodal distribution with 
an extra sharp and condensed cluster of high frequency events centred 
around 14 ms (corresponding to about 71 Hz), indicating a significant 
weight increase of burst firings (Fig. 2i). 

To test whether enhanced bursting is universal in depression-like 
states, we used a second animal model of depression, mice with chronic 
restraint stress (CRS) (Extended Data Fig. 3). Patch-clamp recording of 
LHb neurons from these mice revealed similar phenomena: namely, a 
marked increase in both the percentage of bursting cells and the per- 
centage of spikes in bursting mode (Extended Data Fig. 3g-i), and 
an extra peak at 20 ms in the ISI distribution (Extended Data Fig. 3)). 
Notably, acute systemic injection of ketamine at antidepressant dosages 
reduced bursting back to control levels in both cLH rats (Fig. 2g, h) and 
CRS mice (Extended Data Fig. 3h, i). 

To test whether bursting also occurs in vivo in the LHb and whether 
it is bidirectionally modulated by depression-like state and ketamine, 
we performed in vivo multi-tetrode recording in the LHb of freely 
behaving mice (Fig. 2j, Extended Data Fig. 4). LHb neurons from CRS 
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mice showed a notable increase in bursting activity (Fig. 2k, 1) but not 
tonic firing (Extended Data Fig. 4f), compared with neurons from naive 
control mice. As a consequence, the mean firing rate was also higher 
in CRS mice than in control mice (Extended Data Fig. 4f). Injection 
of ketamine (10 mg kg™', intraperitoneal, 1h before recording) sig- 
nificantly suppressed LHb bursting activity (P < 0.0001, Fig. 2m and 
Extended Data Fig. 4i) and overall firing rate (Extended Data Fig. 4h). 
The cumulative frequency distributions of ISIs, which clearly differed 
between CRS and control mice, were significantly shifted towards the 
control level by ketamine (P < 0.0001, Fig. 2n). 

Burst firing is known to increase network synchronization’’. We 
therefore tested whether LHb network synchronization was altered in 
the CRS mice. We calculated the spike-triggered averages (STAs) of 
local field potentials (LFPs), which revealed oscillatory synchronization 
between spikes and LFPs (see Methods). In control mice, the distribution 
of the power spectra of STAs was relatively flat (Extended Data Fig. 4j), 
indicating a lack of synchronization. In CRS mice, there emerged a 
dominant frequency of 7 Hz in the power spectra of STAs (Extended 
Data Fig. 4j), indicating that spikes tended to phase-lock with LFPs 
in the theta-band range. Consistently, CRS mice showed significantly 
higher spike-field coherence (SFC, reflecting the normalized power 
spectra of STAs) in the theta band range compared with control mice 
(P= 0.022, Fig. 20). These changes in LHb network synchronization 
were reversed to control level 1h after intraperitoneal injection of 
ketamine (10mg kg™!, Fig. 20 and Extended Data Fig. 4}). 


LHb bursts require NUDARs 

Given that NUDAR-mediated calcium influx plays a pivotal role in burst 
generation in several brain regions***°, and in light of our finding that 
systematic injection of ketamine suppressed bursting activity (Fig. 2), 
we tested whether NMDARs are directly required for the bursting 
activity in the LHb. NMDAR currents were detected in LHb neurons 
(Extended Data Fig. 5a, b). Bath application of ketamine (100 1.M) 
to LHb brain slices did not change the neuronal RMPs (Extended 
Data Fig. 5c) or miniature excitatory postsynaptic potentials (EPSCs; 
Extended Data Fig. 5h-j), but completely eliminated spontaneous burst 
firing within seconds of application (Fig. 3a). Lowering the concentra- 
tion of ketamine to 101M or 1M, close to the effective antidepres- 
sant concentration (Fig. le), also completely blocked bursting, but with 
a longer delay (Fig. 3b, c). Bath application of the specific NUDAR 
antagonist AP5 (1001M) also stopped burst firing (Fig. 3d). Notably, 
consistent with the modest behavioural effects of cannular infusion 
(Fig. 1)), full blockade of AMPARs with NBQX (101M) reduced bursts 
only moderately (Fig. 3e). Application of the classical specific serotonin 
reuptake inhibitor (SSRI) antidepressant fluoxetine did not affect LHb 
bursts at this rapid time scale (Fig. 3f). Furthermore, perfusion of brain 
slices with NMDA (201M) in artificial cerebrospinal fluid (ACSF) with 
no Mg”* induced strong bursting activity in 10 out of 13 originally 
silent LHb neurons (Fig. 3g, h). Again this bursting activity was blocked 
by additional bath application of ketamine (Fig. 3h). 


LHb bursts depend on RMP and T-VSCCs 

Given the correlation between RMPs and firing modes of LHb 
neurons” (Fig. 2d, e), we next tested whether changing RMPs can alter 
the pattern of spiking activity in the LHb. By applying a transient ramp- 
like current injection that enabled RMPs to change progressively”® 
from around —80 to —40 mV (Extended Data Fig. 6a), we found that 
in 90% (n= 48) of rat and 93% (n = 26) of mouse LHb neurons, the 
hyperpolarization current injection was able to evoke high frequency 
bursts of action potentials. The intra-burst frequencies of the ramp- 
evoked bursts were positively correlated with the hyperpolarization of 
membrane potentials (Extended Data Fig. 6b). The duration of bursts 
tended to decrease with increased hyperpolarization (Extended Data 
Fig. 6c). Consequently, the number of spikes in each burst, which is 
the product of the intra-burst frequency and burst duration, were 
normally distributed from —80 mV to —40mV and peaked at 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


: > F 

Silent € Tonic 
do 
2s 

TTX (1M) 
-55 mV -56 mV -43 mV", 
50 ms 
e Silent ¢ Tonic ¢ Burst g 


SD 


* 


cLH rats 
(n = 102) 


+ Ctrl -Ctrl —CRS —CRS+ket 


=CRS (n = 18) 


Ctrl © 


é 3 
g =aCRS+ket (n = 18) D = 
@ = 8 
g eek woe E 
B100; 150;-— 8 . 

= fe} 
ae c= 7 o Re 
$ = 100 @ os 
£ 60 5 7 8 
Qa om b=} : as 
2 40 a 8 —CRS+ket 26 
S % 50 2 tt +e 
2 20 g E : 28 
e ‘a 6 9 400 B00 
O30 Q Inter-spike interval 
2 (ms) 


Figure 2 | Ketamine suppresses enhanced LHb bursting activity and 
theta-band synchronization in animal models of depression. 

a-c, Representative traces showing spontaneous activity of silent (a), 
tonic-firing (b), and burst-firing (c) LHb neurons recorded with whole- 
cell patch-clamp. Red traces are responses of the same neurons after TTX 
treatment. d, e, Scattered plots (d) and cumulative curves (e) denoting the 
mean and distribution of RMPs. f-i, Burst firing is significantly increased 
in cLH rats, and this increase is reversed by ketamine (i-p., 25 mg kg”). 

f, Recording sites across different subregions of LHb. g, Pie charts 
illustrating the per cent abundance of the three types of LHb neurons. 

h, Bar graph illustrating the percentage of burst- and tonic-type spikes in 
all spikes recorded. i, Histogram of ISI distribution. j, Recording sites of 
each tetrode track in the LHbs of CRS and control mice. k, Example traces 


about —56 to —60 mV (Extended Data Fig. 6d), close to the average 
RMPs observed in spontaneous bursting LHb neurons (Fig. 2d, e). 
This voltage-dependent transition of firing mode also occurred in 
spontaneously spiking LHb neurons: by delivering a hyperpolarizing 
current injection (—10 to —20 pA), 75% (m= 20) of originally 
tonic-firing neurons were transformed to burst-firing (Extended Data 
Fig. 6e); vice versa, a depolarizing current injection (10-20 pA), trans- 
formed 100% (n= 7) of originally bursting neurons to tonic-firing 
mode (Extended Data Fig. 6f). 

The pacemaker channel T-VSCC, unlike other types of voltage- 
sensitive calcium channel, is inactivated quickly after opening at depo- 
larized membrane potentials, but can be de-inactivated to initiate burst 
firing when the membrane potential is hyperpolarized for longer than 
100 ms*!. We detected T-VSCC currents in both rat and mouse LHb 
neurons* (Extended Data Fig. 7a, b). Bath application of the specific 
T-VSCC blocker mibefradil (101M) onto the LHb brain slices effec- 
tively decreased the bursting probability and reduced the amplitude of 
the plateau potentials of spontaneous bursts (Fig. 4a), without signi- 
ficantly changing RMPs (Extended Data Fig. 5f). ZD7288 (50|1M), an 
antagonist of another pacemaker channel (hyperpolarization-activated 
cyclic nucleotide-gated (HCN) channel), had a significant but much 
smaller effect on bursts than mibefradil (P = 0.018, Fig. 4b). 
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(left) and averaged spike waveform (right) of recorded neurons from the 
LHbs of control (top) and CRS mice (middle) and the same CRS unit after 
ketamine injection (bottom). Bursts (grey shades) are identified by the 

ISI method (see Methods). |, m, Per cent of spikes in bursting mode and 
number of bursts per minute of neurons recorded from control mice, CRS 
mice (1), and the same unit in CRS mice 1h before and after ketamine 
injection (m). n, Cumulative distribution of ISIs. Dashed lines indicate 
the 50th percentile of ISI. 0, Spike-field coherence (SFC) of neurons 
recorded from control and CRS mice, and from CRS mice after ketamine 
injection. Left, SFC of each unit; middle, average SFC; right, per cent SFC 
in theta band (4-10 Hz). Data are mean +s.e.m. *P < 0.05, **P< 0.01, 
*EP < 0,001, ****P < 0.0001, NS, not significant. See Supplementary 
Table 1 for statistical analysis and m numbers. 


To understand how NMDARs and T-VSCCs work synergistically 
to mediate LHb burst firing, we constructed a minimal biophysical 
model that successfully recapitulated the key characteristics of the 
LHb bursts, including the voltage dependency and the in silico 
knockout effects of T-VSCCs or NMDAR (Extended Data Fig. 8; see 
Methods). 

Although AMPAR blockade reduced LHb burst probability only 
slightly (Fig. 3e), application of AMPA onto LHb neurons or increased 
AMPAR current increased the burst frequency in electrophysiology 
recording or modelling experiments, respectively (Extended Data 
Figs 8h, 9a). Blockade of GABA receptors (GABARs) also increased 
burst firing (Extended Data Fig. 9b). Thus, burst generation in the 
LHb relies on intrinsic cellular properties and is modulated by network 
synaptic inputs. 

The above results predict that drugs that block T-VSCCs may also 
have rapid antidepressant effects. Indeed, both systematic injection of 
the T- VSCC blocker 2-ethyl-2-methylsuccinimide (ethosuximide), 
which can cross the blood-brain barrier, in CRS mice and bilateral 
infusion of mibefradil (10 nmol) into the LHb of cLH rats caused rapid 
antidepressant effects in both the FST and the SPT (Fig. 4c-g), with- 
out inducing significant changes in total distance travelled in the OFT 
(Extended Data Fig. 1b, d). 
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Figure 3 | LHb bursting requires activation of NMDARs. a, d-f, Example 
traces (left) and statistics (right, sampled within 3 min before and 1 min 
after drug application) showing effects of ketamine (a), AP5 (d), NBQX (e) 
and fluoxetine (f) on spontaneous bursts in the LHb of wild-type and cLH 
rats. Spikes in bursting and tonic-firing mode are shown in blue and black, 
respectively. b, Blockade of spontaneous LHb burst by different dosages of 
ketamine. Bursts per minute are calculated before (—1 to 0 min) and after 
(0-1 min, 3-4 min and 10-15 min) ketamine perfusion. c, Average time 


LHb bursts drive depression 

To test whether increased burst activity is sufficient to induce depres- 
sion-like behaviours, we used optogenetic tools to drive bursting 
activity in the LHb. We first used a variant of channelrhodopsin 2 
(ChR2), oChIEF, which is optimized to follow high-frequency stimu- 
lation. However, LHb neurons expressing oChIEF could not follow 
pulsed 100-Hz light stimulation (Extended Data Fig. 10a). Consistently, 
no depression-like behaviours were induced with this protocol 
(Extended Data Fig. 10b-d). 
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required to fully block LHb bursts by different concentrations of ketamine. 
g, Example trace of an originally silent LHb neuron induced to burst by 
NMDA perfusion and returned to silence after ketamine application. 

Note that NMDA markedly enhances both EPSPs (green shading) and 
burst firing. h, Summary of effects of NMDA perfusion and ketamine on 
bursts. Data are mean +s.e.m., *P< 0.05, **P< 0.01, ***P< 0.001, 

*** P< (0.0001, NS, not significant. See Supplementary Table 1 for 
statistical analysis and n numbers. 


On the basis of the observation that a hyperpolarization ramp 
current could induce burst firing in the LHb (Extended Data Fig. 6a), 
we devised a protocol involving a transient (100-ms) hyperpolarization 
current injection, which induced NMDAR- and T-VSCC-dependent 
rebound bursts in LHb brain slices with 100% success (Extended 
Data Fig. 6g-k). We therefore used an inhibitory opsin, eNpPHR3.0 
(an enhanced variant of halorhodopsin), to drive rebound bursts in 
the LHb (Fig. 5a). Yellow light pulses (1 Hz, 100 ms at 589 nm) reli- 
ably elicited robust rebound bursts in in vitro slice recordings with a 
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Figure 4 | Antagonists of T-VSCCs block LHb bursts and cause rapid 
antidepression. a, b, Example traces (left) and statistics (right, sampled 
within 3 min before and 1 min after drug application) showing effects 
of mibefradil (a) or ZD7288 (b) on spontaneous bursts in the LHb of 
wild-type and cLH rats. c-g, Antidepressant effects caused by systemic 
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ethosuximide injection (200 mg kg', i.p.; ¢, d) or local bilateral infusion 
of mibefradil into the LHb (e-g) in the FST (c, f) and SPT (d, g) 1h after 
treatment. Infusion sites verified by CTB (e). Data are mean + s.e.m., 

*P <0.05, **P <0.01. See Supplementary Table 1 for statistical analysis 
and m numbers. 
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Figure 5 | eNpHR3.0-induced rebound bursting drives behavioural 
aversion and depression-like symptoms that are reversible by ketamine. 
a, Construct of AAV2/9-eNpHR3.0 (top), example site of viral injection 
and optic fibre implantation (middle), and illustration of optetrode 
recording (bottom). b, c, Representative traces showing rebound bursts 
reliably elicited by pulsed yellow light in LHb brain slices in vitro (b) 

and in vivo (c) from mice infected with AAV2/9-eNpHR3.0. Spikes in 
bursting and tonic-firing mode are shown in blue and black, respectively. 
Percentage of successfully induced bursts is shown at bottom right. 

d, Raster plots (top) and post-stimulus time histogram (bottom) of an 
example LHb neuron responding to 100-ms yellow light stimulation from 


high intra-burst frequency and 90% success rate (Fig. 5b). Optetrode 
recording in vivo revealed that the intra-burst frequency and intra- 
burst number of spikes produced by this rebound burst protocol were 
comparable to those detected in depressed CRS mice (Fig. 5c-e). 

We then tested whether rebound bursts in the LHb could acutely 
drive aversion and depression-like symptoms in freely behaving 
mice. In the real-time place aversion (RTPA) assay, 1 Hz yellow light 
photostimulation significantly reduced the time spent in the light- 
paired chamber in mice injected with an adenoviral vector encoding 
eNpHR3.0 but not in those injected with a vector encoding eGFP 
(P= 0.0064, Fig. 5f), without changing velocity (Extended Data 
Fig. 10j). Furthermore, 1 Hz yellow light photostimulation of mice 
expressing eNpHR3.0 significantly increased immobility in the FST 
(P=0.0049, Fig. 5g) without altering locomotion in the OFT (Extended 
Data Fig. 10k), and decreased sucrose preference (Fig. 5h). A stimu- 
lation protocol (5 Hz on AAV-oChIEF injected mice) that yields the 
same overall firing rate as the rebound burst protocol (Extended Data 
Fig. 10e) did not cause depression-like phenotypes (Extended Data 
Fig. 10f-h), indicating that it is the pattern of burst firing rather 
than a mere increase in spikes that is important for the induction of 
depression-like behaviours. Notably, ketamine reversed both rebound 
burst-induced RTPA and depression-like behaviours (Fig. 5i-k). 


in vivo optetrode recording. e, Distribution of intra-burst frequencies 
and intra-burst spike numbers of eNpHR3.0-driven rebound bursts 

(left) are comparable to those recorded from CRS mice (right). Means 
are represented by black crosses. f, i, RTPA (f) induced by eNpHR3.0- 
driven bursts reversed by ketamine (i.p. injection, 1h before test, i). Left, 
representative heat maps of RTPA; right, quantitative aversion score (see 
Methods). g, h, j, k, Depression-like behaviours in the FST (g) and SPT (h) 
induced by eNpHR3.0-driven bursts reversed by ketamine (i.p. injection, 
1h before test, j, k). Data are represented as mean +s.e.m., **P <0.01, 
*** P< ).001. See Supplementary Table 1 for statistical analysis and n 
numbers. 


Discussion 

We have provided multiple lines of evidence to support the idea that 
a depression-like state depends critically on a bursting mode of firing 
in the LHb, and that the rapid antidepressant effects of ketamine are 
induced largely by targeting this burst firing. Burst firing can reduce 
synaptic transmission failure, enhance the signal-to-noise ratio, facil- 
itate synaptic plasticity or promote neuropeptide release**. Thus, 
enhanced bursting in the LHb may provide a stronger output than 
tonic firing onto the downstream inhibitory RMT¢ or onto inhibitory 
interneurons within the VTA or DRN, to suppress the activity of dopa- 
minergic, serotinergic or glutamatergic neurons in the reward centres 
of the brain. By blocking NMDAR-dependent LHb bursts, ketamine 
can then release this inhibition brake onto the reward centres to elicit its 
rapid antidepressant effects. Although our data addresses how ketamine 
can act so rapidly, the mechanism for the long-term effects of ketamine 
remains unclear; it might involve upregulation of brain-derived neuro- 
trophic factor (BDNF)? or synaptogenesis‘. 

Given the importance of LHb bursting activity for the depressive 
state, it is plausible that entry into this mode should be carefully 
controlled. We found no significant changes in T-VSCC currents in 
the LHb of animal models of depression (Extended Data Fig. 7c, d). 
Instead, the RMPs of LHb neurons in these models were more 
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hyperpolarized (by 5-6 mV) than those of control animals (Extended 
Data Fig. 7e, f). Considering that hyperpolarization can convert silent 
or tonic-firing neurons to bursting mode (Extended Data Fig. 6e), 
we hypothesize that a molecular mechanism that lowers the RMPs of 
LHb neurons to the level at which T-VSCCs can be de-inactivated may 
account for the increase of LHb bursting in depression-like states. In 
the accompanying paper“, we show that increased levels of astrocytic 
Kir4.1 and potassium buffering may be responsible for the hyperpo- 
larization of RMPs and for the increased bursting of LHb neurons in 
depression. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Animals. Male cLH rats (8-16 weeks of age) and age-matched male Sprague 
Dawley rats (SLAC Laboratory Animal Co.) were used. The cLH rats were screened 
using the learned helpless test!”*° for breeding as previously described*>. Male 
adult (8-16 weeks of age) C57BL/6 mice (SLAC) were used for establishing the 
chronic restraint stress (CRS) depression model. Rats were group-housed two per 
cage and mice were housed four per cage under a 12-h light-dark cycle (light on 
from 7 a.m. to 7 p.m.) with free access to food and water ad libitum. All animal 
studies and experimental procedures were approved by the Animal Care and Use 
Committee of the animal facility at Zhejiang University. 

Systemic drug delivery for antidepressant. All drugs were dissolved in 0.9% 
saline, and administered intraperitoneally (i.p.). Concentrations were as follows: 
ketamine (Gutian Pharma Co.) 25 mg/kg for rats*° and 10 mg/kg for mice®; etho- 
suximide (2-ethyl-2-methylsuccinimide: a T- VSCC blocker, which can cross the 
blood-brain barrier*”, Selleck) 200 mg/kg for mice. One hour after drug delivery, 
animals were used for behavioural or in vivo electrophysiology studies, or killed 
for in vitro electrophysiology studies. 

Cannula infusion experiment. A 26-gauge double guide cannulae (centre-to-centre 
distance 1.4mm, Plastics One) was placed at a 2° angle to the coronal plane 
(without the 2° rotation, we found it difficult to hit both sides of the LHb) and 
inserted bilaterally into the LHb (AP, —3.7mm from bregma; ML, + 0.7 mm; DV, 
—4,05mm from the brain surface) of cLH rats. A 33-gauge double dummy cannulae 
(Plastic One), secured with a dust cap, was inserted into the guide cannula to 
prevent clogging during the recovery period. After rats had recovered for at least 
7 days, drugs were microinjected with a 33-gauge double injector cannulae, which 
has a 0.6-mm extension beyond the tip of the guide cannula, while cLH rats were 
anaesthetized with isoflurane on an anaesthetic machine. The extensions were 
manually sharpened before insertion. 

Ketamine (25 j1g/ul, [C59 = 5.35 1M"), APS (40 nmol/l, [C59 = 30 uM’), NBQX 

(1 or 5nmol/jl, [C59 = 0.15 1M”°) or mibefradil (10 mol/l, C59 = 2.7 .M*!) were 
dissolved in 0.9% saline. Before the local drug infusion, tip-sharpened 33-gauge 
double injector cannulae were inserted into the guide cannulae to ensure clear 
passage and then pulled out. One microlitre of drug was infused (0.1 l/min) into 
each side through another set of tip-sharpened 33-gauge double injector cannulae, 
which were connected to the microsyringe. The injector cannulae were left in place 
for an additional 10 min to minimize spread of the drug along the injection track. 
The FST or SPT was performed 1h after the injection of ketamine or mibefradil 
and 0.5h after the injection of AP5 or NBQX. To verify the drug infusion sites, 
rats were injected with 1 jl CTB-488 to each side of the LHb after all behavioural 
tests. For immunostaining, rats were then euthanized 30 min after CTB injection 
and processed as described”. Brain slices were counterstained with Hoechst before 
mounting on the slides. Fluorescent image acquisition was performed with an 
Olympus VS120 virtual microscopy slide scanning system. Only data from rats 
with correctly sited injections were used. 
LC-MS/MS measurement of ketamine concentration. Male cLH rats were 
anaesthetized by isoflurane and subsequently decapitated 1h after drug admin- 
istration (i.p. or LHb cannulae infusion). Habenular or ventral hippocampal tis- 
sues (about 4 mg) were immediately dissected from the brain with Brain Matrix 
(RWD). The tissues were rinsed with PBS and immediately frozen in liquid nitro- 
gen and stored at —80°C until analysis. After samples were thawed, 100 1] solution 
(ddH20: acetonitrile = 9:1, v/v) and steel balls were added to homogenize samples 
for 2 min at 60 Hz with a tissue grinder. Ten microlitres of fluvoxamine solution 
(0.5 j1g/ml) was then added for vortex mixing, followed by the addition of 100 jl 
supersaturated sodium carbonate. After vortex mixing for 10s, the samples were 
extracted by liquid-liquid extraction with 80011 methyl tert-butyl ether under 
continuous shaking for 2 min at room temperature. After centrifugation for 10 min 
at 13,000 r.p.m. at 4°C, 64011 of the upper layer was removed into a centrifu- 
gal concentrator (CentriVap, LABCONCO) to spin at 1,725 r.p.m. and 30°C for 
45 min, then vortexed with 401 solution (ddH20: acetonitrile = 1:1, v/v). After 
centrifugation for 5 min at 13,000 r.p.m. and 4°C, the upper layer was injected into 
the chromatographic system. 

The concentrations of (R,S)-ketamine in habenular and hippocampal 
tissue were determined by achiral LC-MS/MS following a previously described 
method* with slight modifications. The analysis was accomplished using an 
Agilent Extend-C18 column (3.0mm x 100mm ID, 1.8j1m; Varian). The mobile 
phase consisted of 0.1% methanoic acid buffer (pH 2.67) as component A and 
acetonitrile as component B at a flow rate of 0.3 ml/min, temporized at 10°C 
(injection volume: 7 11). A linear gradient was run as follows: 0-0.2 min, 10% B; 
0.2-1.5 min, from 10% B increased to 90% B; 1.5-3.0 min, 90% B; 3.0-3.1 min, from 
90% B decreased to 10% B; 3.1-4.0 min, 10% B. Ten microlitres of (R,S)-ketamine 
of different concentrations (0.1, 1, 5, 10, 50, 100, 200 ng/ml) was mixed with 90 tl 
brain tissue to establish the standard calibration curve. The quantification of 
(R,S)-ketamine was accomplished by calculating area ratios using fluvoxamine 
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(10,11 of 0.5 1g/ml solution loaded) as the internal standard. The MS/MS analy- 
sis was performed using a triple quadruple mass spectrometer model AB SCIEX 
4000 plus (AB Company). The data were acquired and analysed using MutiQuant 
3.0.2. Positive electrospray ionization data were acquired using multiple reaction 
monitoring (MRM) using the following transitions for (R,S)-ketamine studies: 
238 — 125. 

Viral vectors. AAV2/9-CaMKIla-eNpHR3.0-eYFP (titre: 7.45 x 10! vector 
genome (v.g.)/ml, dilution: 1:5, 0.111 unilateral into LHb, Taitool Bioscience), 
AAV2/9-Ubi-eGFP (titre: 2.5 x 10! v.g./ml, 1:30, 0.111 unilateral into LHb, 
University of Massachusetts, Guangping Gao Laboratory), and AAV2/9-hSyn- 
oChIEF-tdTomato (titre: 6.29 x 10!” v.g./ml, 1:5, 0.141 unilateral into LHb, Obio 
Technology) were aliquoted and stored at —80°C until use. 

Behavioural assays. All behavioural assays were performed on animals 12-16 
weeks old. Most behaviours were performed during the light phase except for the 
sucrose preference test, which was performed during the dark phase to maximize 
the consumption of solution, and the open field test for CRS mice. Behavioural 
analysis was performed blinded to experimental conditions. 

Chronic restraint stress (CRS). Mice were subjected to chronic-restraint stress 
by placement in 50-ml conical tubes with holes for air flow for 2-3h per day for 
14 consecutive days’. 

Open field test (OFT). Animals were placed in the centre of an arena 
(40cm x 40cm x 40.5cm for mice and 100cm x 100cm x 50cm for rats) ina room 
with dim light for 10 min. A video camera positioned directly above the arena was 
used to track the movement of each animal (Any-maze, Stoelting). For optogenetic 
manipulations, mice were allowed to move freely throughout the arena for 9 min, 
with laser stimulation occurring during the middle 3-min epoch. 

Forced swim test (FST). Animals were individually placed in a cylinder (12cm 
diameter, 25 cm height for mice; 20cm diameter, 50 cm height for rats) of water 
(23-25°C) and swam for 6 min under normal light’. Water depth was set to pre- 
vent animals from touching the bottom with their tails or hind limbs. Animal 
behaviours were videotaped from the side. The immobile time during the last 
4-min test was counted offline by an observer blinded to animal treatment. 
Immobile time was defined as time when animals remained floating or motion- 
less with only movements necessary for keeping balance in the water. For rats, an 
additional pre-test was conducted 24h before the test, during which rats were indi- 
vidually placed in a cylinder of water with conditions described above for 15 min. 
For optogenetic manipulations, laser stimulation was delivered immediately after 
mice were placed in the water and lasted for 6 min. In order to minimize the impact 
of the optogenetic cable on swimming behaviour, the cable length was adjusted to 
allow the cable to just touch the water surface. 

Sucrose preference test (SPT). Animals were single housed and habituated with 
two bottles of water for 2 days, followed by two bottles of 2% sucrose for 2 days. 
Animals were then water deprived for 24h and then exposed to one bottle of 2% 
sucrose and one bottle of water for 2h in the dark phase. Bottle positions were 
switched after 1h (for 2h test). Total consumption of each fluid was measured and 
sucrose preference was defined as the average sucrose consumption ratio during 
the first and second hours. Sucrose consumption ratio was calculated by divid- 
ing the total consumption of sucrose by the total consumption of both water and 
sucrose. For optogenetic manipulations, mice were gently placed in a white arena 
containing normal bedding and allowed to move freely in the arena. During the 
90-min test, light was delivered during 30-60 min. Sucrose preference scores were 
measured for every 30 min. Only animals that had a baseline sucrose preference 
greater than 30% during the first 30-min session proceeded to the next session. 
Otherwise they were tested later on a different day”. 

Real-time place aversion (RTPA). Mice were placed in a white open chamber 
(52cm x 26cm x 23cm) consisting of two chambers, and allowed to freely move 
between chambers for 20 min to assess their baseline place preference*®. During the 
following 20-min test, we assigned a stimulated side in a counterbalanced manner. 
Laser stimulation was delivered as soon as mice entered the stimulated side and 
terminated once mice crossed to the non-stimulated side. A video camera posi- 
tioned above the chamber recorded each trial. Mouse locations and velocity were 
tracked and analysed using Any-maze software (Stoelting). Avoidance score was 
calculated as (Time in stimulated side — Time in non-stimulated side) x, — (Time 
in stimulated side — Time in non-stimulated side)paseline- 

Optogenetic light delivery and protocols. For mice expressing eNpHR3.0% and 
eGFP, a 589-nm yellow light laser was delivered at 1 Hz, 100 ms pulse. Light inten- 
sity was calculated to be 16 mW, which was estimated at 3.5 mm from optic patch 
cable tip with a ceramic sleeve. For the SPT experiment, laser intensity during 
eNpHR3.0 stimulation was adjusted to 10 mW because LHb neurons could not 
follow 16 mW stimulation for longer than 10 min (amplitude of bursting spikes 
declines to noise level). For mice expressing oChIEF”’ and eGFP, a 473-nm blue 
light laser was used; light intensity was calculated to be 25 mW. The blue light 
delivery paradigm was either 5 ms pulse, 5 pulses per s for the pulsed 100-Hz 
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protocol, or 20 Hz, tonic 5-ms pulses for the 20-Hz protocol, or 5 Hz, tonic 5-ms 
pulses for the 5-Hz protocol. 

LHb brain slice preparation. Animals (postnatal day (P)45-70 rats and P65-75 
mice) were anaesthetized with isoflurane and 10% chloral hydrate, and then per- 
fused with 20 ml ice-cold ACSF (oxygenated with 95% O + 5% CO2) containing 
(mM): 125 NaCl, 2.5 KCI, 25 NaHCOs, 1.25 NaH2POu,, 1 MgCl, and 25 glucose, 
with 1mM pyruvate added. The brain was removed as quickly as possible after 
decapitation and put into chilled and oxygenated ACSF. Coronal (for most of 
experiments if not specified) or sagittal slices containing the habenula (350,1m 
and 300\1m thickness for rats and mice, respectively) were sectioned in cold ACSF 
using a Leica2000 vibratome and then transferred to ASCF at 32°C for incubation 
and recovery. ACSF was continuously gassed with 95% O2 and 5% COs. Slices 
were allowed to recover for at least 1 h before recording. For cLH rats, as a very 
high percentage (90%) of cLH offspring are learned helpless, we did not perform 
a learned helplessness test before taking them for brain slice recording. For CRS 
mice, both CRS and wild-type control mice went through an FST before brain slice 
recording. We then used the CRS animals that showed high immobility scores 
(immobile time >140s) and control mice that showed low immobility (immobile 
time <110s) in the FST for slice recording. 

In vitro electrophysiological recordings. For LHb neuron recordings, currents 
were measured under whole-cell patch-clamp using pipettes with a typical resist- 
ance of 5-6 MQ? filled with internal solution containing (mM) 105 K-gluconate, 
30 KCI, 4 Mg-ATP, 0.3 Na-GTP, 0.3 EGTA, 10 HEPES and 10 Na-phosphocreatine, 
with pH set to 7.35. The external ACSF solution contained (in mM) 125 NaCl, 
2.5 KCI, 25 NaHCOs, 1.25 NaH2POx, 1 MgCly and 25 glucose. Cells were 
visualized with infrared optics on an upright microscope (BX51WI, Olympus). 
A MultiClamp 700B amplifier and pCLAMP10 software were used for electro- 
physiology (Axon Instruments). The series resistance and capacitance were 
compensated automatically after a stable Gigaseal was formed. Recordings were 
typically performed between 3 and 6 min after break-in. The spontaneous neuronal 
activity was recorded under current clamp (I=0 pA). 

LHb neurons show three modes of spontaneous activity at resting conditions. 
Silent cells showed no spike activity during recording. Tonic cells spontaneously 
generated tonic trains of action potentials at frequencies between 0.1 and 10 Hz, 
rarely up to 10-20 Hz. Burst-firing cells spontaneously generated clusters of spikes 
with an initially high but progressively declining intra-burst firing frequency in 
each burst. For bursting neurons, 99% (from n= 50 bursting neurons) of their 
spikes occurred within bursts. The inter-spike intervals (ISI) of neurons represent 
the duration of the silent periods between two neighbouring single spikes. A typical 
bursting neuron shows a bimodal distribution of ISIs since it is composed of 
relatively large inter-burst intervals and small intra-burst intervals. By contrast, 
tonic-firing neurons show a more homogenous Poisson distribution of ISIs. 

Evoked NMDAR EPSCs were recorded under voltage clamp from —50mV 
to —80 mV in sagittal LHb slices by stimulating the input stria medullaris fibre 
in a modified extracellular ACSF solution*® with the AMPAR blocker NBQX, 
GABAR blocker picrotoxin and 0 Mg’ and isolated currents were confirmed to 
be NMDAR currents because they could be abolished by AP5 (Extended Data 
Fig. 5a, b). Evoked T-type VSCC currents were recorded under voltage clamp 
starting from a holding potential of -50 mV before being increased to conditioning 
potential (—100 mV) for 1s preceding the command steps (5 mV, 0.1 Hz per step 
increment). LHb T-VSCC currents were obtained by subtraction of recorded traces 
in the presence or absence of mibefradil*? (Extended Data Fig. 7a-d). 
Pharmacological characterization of burst and neuronal RMP. Ketamine (100|1M, 
10mM and 14M, Gutian Pharma Co.)°, AP5 (1004.M, Sigma)”°, NBQX (10,.M, 
Sigma)?!, mibefradil (101M, Sigma)°”, ZD7288 (501M, Tocris), TTX (1M, 
Sigma)’, picrotoxin (100,.M, Sigma)”°, NMDA (20M, Sigma)™, AMPA (14M, 
Sigma)** and fluoxetine (30 |.M, Sigma)*>** were used for electrophysiology. Drugs 
were dissolved in water, aliquoted at 1,000 final concentration and stocked at 
—80°C before use. Recordings were performed between 3 and 6 min after whole 
cell break-in. During recordings, baselines of spontaneous or rebound bursts and 
RMPs were recorded for at least 3 min before drugs were perfused. The start of drug 
perfusion was precisely indicated by a bubble that was added before the transition 
from normal ACSF to ACSF with drug. 

The effect of ketamine (100 |1M) on mEPSCs was tested under voltage clamp 
(the cells were held at —60 mV) in the presence of TTX (11M) and picrotoxin 
(100 1M) in ACSF. The intracellular solution contained CsMeSO; 115 mM, 
CsCl 20 mM, HEPES 10 mM, MgCl, 2.5 mM, Na2-ATP 4mM, Na-GTP 0.4mM, 
Na-phosphocreatine 10 mM, and EGTA 0.6 mM. Data were filtered at 2 kHz 
and sampled at 10 kHz using Digidatal1550. The baseline was recorded for 
5 min. Ketamine was then perfused for 10 min. mEPSCs were analysed from 
2 to 4min before and 5 to 7 min after the start of ketamine perfusion. Data 
were analysed by Mini Analysis Program (Synaptosoft) with an amplitude 
threshold of 5 pA. 


In vivo electrophysiology. For in vivo recording experiments, a custom-made 
microdrive array consisting of eight tetrodes (impedance 250-500 KQ, California 
fine wire)°’ was implanted into the LHb (AP, —1.72 mm; ML, 0.46 mm; DV, 
—2.44mm from the brain surface) of control or CRS mice. Stainless steel wires were 
attached to two screws on the skull as ground. The microdrive was secured to the 
skull with dental cement. After recovery for 2 weeks, mice were allowed to adapt 
to the recording headstage for 10 min per day for 2-3 days. Spontaneous spiking 
activity (digitized at 40 kHz, band-pass filtered between 300 and 6,000 Hz) and LFP 
(digitized at 1 kHz sampling rate, low-pass filtered up to 250 Hz) were recorded 
simultaneously for 30 min during the still period of the mice in their home cages 
with a gain of 5,000. A single channel without a detectable unit was assigned 
as a reference electrode. The tetrodes were lowered in steps of 70j1m after each 
recording session, followed by at least a 2-day recovery. For CRS mice, data were 
recorded for 30 min before and 1h after ketamine treatment (10 mg/kg, i.p.). If mice 
received a second ketamine injection, at least a 2-week interval was introduced 
before the next recording session. All procedures were performed during the light 
phase. The positions of the electrodes were verified by electrolytic lesions (301A, 
10-15s) at the end of all experiments. 

Spike sorting. All waveforms recorded from each tetrode were imported in Offline 
Sorter V3 (Plexon Inc.). Single units were manually identified by threshold crossing 
and principal component analysis (PCA). Spikes with inter-spike interval (ISI) less 
than the refractory period (1.4ms) were excluded. Cross correlograms were plotted 
to ensure that no cell was discriminated more than once on overlapping tetrodes. 
Only units with a signal-to-noise ratio larger than 2 were used. 

Data analysis. Data analysis was conducted by Neuroexplorer4 (Plexon Inc.) and 
MATLAB. For criteria to identify in vivo bursts in the LHb, we analysed intra-spike 
intervals from 120 spontaneous bursts (from 10 neurons) recorded in LHb brain 
slices (Extended Data Fig. 4c). All bursts began with a maximal inter-spike interval 
of 20 ms and ended with a maximal inter-spike interval of 100 ms (Extended Data 
Fig. 4c). Accordingly, we defined in vivo bursting as clusters of spikes beginning 
with a maximal inter-spike interval of 20 ms and ending with a maximal inter-spike 
interval of 100 ms (Extended Data Fig. 4d). The minimum intra-burst interval was 
set at 100 ms (Extended Data Fig. 4d) and the minimum number of spikes in a burst 
was set at 2. Burst events per minute, the percentage of spike firing within bursts, 
intra and inter-burst intervals and average spike numbers in a burst were analysed. 

The raw LFPs were filtered between 1 and 200 Hz using a fourth-order 
Butterworth filter. To measure the synchronization between spikes and local field 
potential oscillations, we calculated spike-triggered averages (STAs)**°? by aver- 
aging all LFP segments at +300 ms around all spikes under one experimental 
condition, and normalized STAs by spike number. Only STAs of spikes and LFPs 
recorded from separate electrodes were used. We then quantified STAs using spike- 
field coherence (SFC)°**”. The spectrum of the STA (fSTA) was calculated using 
the multitaper method (Chronux 2.0) with seven tapers. The average of these indi- 
vidual frequency spectra results in the spike triggered power (STP). The SFC was 
calculated as SFC (f) = [fSTA(f)/STP(f)]. SFC in the theta band was calculated as 
the average SFC in the theta frequency domain (4-10 Hz). 

In vivo optetrode recordings. To validate the effects of optogenetic stimulation on 
neuron firing, optetrode recording was applied oton mice injected with AAV2/9- 
CaMKIla-eNpHR3.0-eYFP into the LHb. Mice were anaesthetized with isoflu- 
rane (5% for induction and 1% for maintenance). An optetrode consisted of four 
tetrodes and an optical fibre in the centre was lowered into the LHb (AP, —1.72mm; 
ML, 0.46 mm; DV, —2.62 mm from the brain surface). The tip of the optical fibre 
was 200 1m above the tip of the tetrodes. For optical stimulation, the optical 
fibre was connected to a 589 nm laser. 100 ms light pulses at 1 Hz were delivered 
to evoke rebound bursting. The light intensity was identical to that used in 
behavioural tests. Single units were recorded and sorted as described above. 
Surgery for optogenetics. Mice (P50-80) were deeply anaesthetized and placed 
in a stereotactic frame (RWD Instruments). Virus (0.1 11) was injected into the 
LHb one one side (AP, —1.72 mm from bregma; ML, + 0.46 mm; DV, —2.62 mm 
from the brain surface) using a pulled glass capillary with a pressure microinjector 
(Picospritzer III, Parker) at a rate of 0.1 ,1l/min. The injection needle was with- 
drawn 10 min after the end of the injection. After surgery, mice recovered from 
anaesthesia on a heat pad. 

For optic fibre implantation, a 200-j1m fibre-optic cannula (Plastics one) was 
placed 300 1m above the centre of viral injection site, and cemented onto the skull 
using dental cement. 

Biophysics modelling. Our single compartment model of a LHb neuron was con- 
structed using the following equations: 


dV /dt = (Tiny — Fr — Ik — INa— Tteak — INateak — Iampa — InmpA-Icaza)/€ 


where V is the membrane potential and c is the capacitance of the membrane. Only 
currents absolutely necessary were included in the model. Jin; is the external current 
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applied to the cell; Iz is the low-threshold Ca?* current mediated by T-VSCCs; Ina 
and Ix are the fast Na* and K* currents of action potentials; Ixieak and INateak are 
responsible for the resting membrane potential; [ampa; Inmpa and Igaga are the 
synaptic currents mediated by AMPA, NMDA and GABA receptors respectively. 
See Supplementary Information for detailed formulae. 

Statistical analysis. Required sample sizes were estimated on the basis of our past 
experience performing similar experiments. Mice were randomly assigned to treat- 
ment groups. Analyses were performed in a manner blinded to treatment assign- 
ments in all behavioural experiments. Statistical analyses were performed using 
GraphPad Prism software v6. By pre-established criteria, values were excluded 
from analyses if the viral injection or drug delivery sites were out of the LHb. 
All statistical tests were two-tailed, and significance was assigned at P< 0.05. 
Normality and equal variances between group samples were assessed using the 
D Agostino & Pearson omnibus normality test and Brown-Forsythe tests, respec- 
tively. When normality and equal variance between sample groups was achieved, 
one-way ANOVA (followed by Bonferroni’s multiple comparisons test), or t-test 
were used. Where normality or equal variance of samples failed, Kruskal-Wallis 
one-way ANOVA (followed by Dunn's correction), Mann-Whitney U test, or 
Wilcoxon matched-pairs signed rank test were performed. Linear regression test, 
Chi-square test, Fisher’s exact test or two-way ANOVA (followed by Bonferroni's 
multiple comparisons test) were used in appropriate situations. The sample sizes, 
specific statistical tests used, and main effects of our statistical analyses for each 
experiment are reported in Supplementary Information Table 1. 

Data availability. The data that support the findings of this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Figure 1 | Representative ion chromatograms of 
separation of ketamine by LC-MS/MS and OFT results for different 
drug treatments. a, Antidepressant response to systemic ketamine 
injection (25 mg kg”, ip.) in FST Lh after treatment in cLH rats. n = 10, 
8, 6 for wild-type rats injected with saline, cLH rats injected with saline, 
and cLH rats injected with ketamine, respectively. b, c, OFT of cLH rats 
after bilateral infusion of ketamine (25 1g, 1 pl each side) or mibefradil 
(10 nmol, 1 pl each side; b), AP5 (40 nmol, 1 tl each side) or NBQX 

(1 nmol, 1 jl each side; c) into the LHb. n =7, 10, 6 rats (b) and 8, 9, 9 
rats (c) for saline, ketamine and mibefradil, respectively. d, OFT of CRS 


mice Lh after intraperitoneal injection of saline or ethosuximide. n= 6, 

9 mice for saline and ethosuximide, respectively. e-g, Representative 

ion chromatograms of separation of ketamine including double blank 

(e), brain calibrator spiked with 100 ng ml"! ketamine (KET) (f) and 

a habenular sample from a cLH rat bilaterally infused with 25 gpl! 
ketamine into the LHb 1h earlier (g). Fluvoxamine (FFSM) is used as the 
internal standard. Data are mean £s.e.m.; *** P< 0.001, **** P< 0.0001; 
n.s., not significant. One-way ANOVA (a-c), two-tailed Mann-Whitney 
test and unpaired t-test (d). 
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Extended Data Figure 2 | I-V relationship, input resistance and burst 
duration are not changed in animal models of depression. a—d, I-V 
plots (a), input resistance (b), intra-burst frequency (c) and inter-burst 
frequency (d) for LHb neurons in brain slices from mice and rats. Note 
that intra-burst frequency (c) but not inter-burst frequency (d) reversely 
correlates with RMPs. n = 20 neurons per group, 5 wild-type and 3 cLH 
rats (b); 7 = 53 neurons, 8 mice and 12 rats (c, d). e-i, Burst duration in 
animal models of depression recorded in brain slices in vitro (e-g) and in 
behaving animals in vivo (h, i). e, Representative trace of a typical burst 
in an in vitro recording. It consists of a depolarizing wave and a high- 
frequency train of action potentials. The duration of the line indicated by 
the black arrow at half-maximum amplitude of the area under the red dash 
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line is defined as the half width of burst duration; the duration between the 
first and last shoots (intra burst spikes) within one burst is defined as the 
shoot duration. f, g, Half widths of burst duration (f) and shoot duration 
(g) do not differ between cLH and wild-type rats. n = 10, 20 neurons, 7 
wild-type and 5 cLH rats. h, Representative trace of an LHb neuron (pink 
shades indicate burst events) recorded in vivo. An enlarged view of a 
typical burst on the right shows the definition of burst duration, which is 
the time interval between the first and last spike within the same burst. 

i, Burst duration of LHb neurons from in vivo recording do not differ 
between control and CRS mice. n = 35, 33 neurons, 5 control and 5 CRS 
mice. Data are mean +s.e.m.; n.s., not significant. Two-tailed unpaired 
t-test (b, f, g) and Mann-Whitney test (i). 
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Extended Data Figure 3 | Chronic restraint stress induces reliable 
depression-like phenotypes and increased burst firing, which can 

be reversed by ketamine. a, b, CRS induces increased immobility and 
decreased latency to immobility in the FST (a) and decreases sucrose 
preference in the SPT (b). n= 8, 14 mice (a) and 7, 8 mice (b) for control 
and CRS group, respectively. c, CRS increases locomotion in the OFT. 
n=8 mice per group. d, e, Ketamine suppresses immobility of CRS 

mice in the FST (d) and increases sucrose preference (e) in CRS mice. 

f, Ketamine decreases locomotion of CRS mice in the OFT. n= 8, 12 mice 
(d, f) and 14, 21 mice (e) for saline and ketamine groups, respectively. 
g-j, Burst firing is significantly increased in CRS mice, and this increase 


is reversed by ketamine (i.p., 10 mg kg™'). g, Whole-cell patch-clamp 
recording sites across different subregions of LHb in mice. h, Pie charts 
illustrating the per cent abundance of the three types of LHb neurons. 

i, Bar graph illustrating the percentage of burst- and tonic-type spikes in 
all spikes recorded. j, Histogram of distribution of inter-spike intervals 
(ISIs). 1 = 63, 69, 57 neurons, 4 control, saline-treated mice, 3 CRS, saline- 
treated mice and 5 CRS, ketamine-treated mice. Data are mean +s.e.m.; 
*P<0.05, **P< 0.01, ****P < 0.0001, n.s., not significant. Two-tailed 
Mann-Whitney test and unpaired t-test (a— f), Chi-square test (h), Fisher’s 
exact test (i). 
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Extended Data Figure 4 | Ketamine suppresses LHb bursting activity 
in vivo. a, Histology of recording site. Arrowhead indicates the electrical 
lesion by tetrodes. b, Representative waveform clusters of two isolated 
LHb units in the respective four channels of a tetrode (left) and principal- 
component analysis (PCA) clustering display of these two units (right). 

c, ISIs between consecutive spikes in relation to their positions within 

the burst (120 bursts from in vitro recording). d, Example recording trace 
(upper) and spike train (bottom) of an irregular-firing LHb neuron from 
an in vivo recording. Bursts (blue sticks) are identified by ISI method 

(see Methods for details): 1, ISI to start burst; 2, ISI to end burst; 3, 
inter-burst interval. e, Histogram of ISI distribution (bin, 2.5 ms) from 

in vivo recording. f-i, Mean of total and tonic firing rates (f, h), intra- and 
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inter-burst frequencies, and number of spikes per burst (g, i) of neurons 
recorded in vivo from control mice, CRS mice and CRS mice 1h before 
and after ketamine injection (i.p., 10 mg kg"). n= 35, 33 neurons (f, g) 
and 18, 18 neurons (h, i), 5 control and 5 CRS mice. j, STAs of neurons 
recorded in vivo from control mice, CRS mice, and CRS mice after 
ketamine injection (i-p., 10mg kg~'). Note that the distance between the 
neighbouring troughs is around 140 ms (corresponding to 7 Hz) in CRS 
mice. Data are mean ts.e.m.; *P < 0.05, **P< 0.01, ***P< 0.001, 

** P< 0.0001, n.s., not significant. Two-tailed Mann-Whitney test and 
unpaired f-test (f, g), two-tailed Wilcoxon matched-pairs signed rank test 
and paired t-test (h, i). 
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Extended Data Figure 5 | Drug effects on NMDAR currents, RMPs and g, ZD7288 causes a small but significant hyperpolarization of LHb neurons. 
miniature EPSCs of LHb neurons. a, Example traces showing evoked n=9 neurons, 4 rats. h, Example mEPSC traces before (black) and 
EPSCs when cells were held at -80 mV. NMDAR EPSCs were isolated by after (red) perfusion of ketamine (100,1M, see Methods) measured in a 
application of picrotoxin (100 1M) and NBQX (10,1M) in Mg**-free ACSE, _ whole-cell configuration (cells were held at -60 mV) from LHb neurons. 
and confirmed by AP5 (100,1M) blockade. b, Amplitudes of NUDAR i, j, Cumulative distribution of mEPSC amplitude (i) or mEPSC inter-events 
EPSCs under different voltages (EPSCs are recorded under 0 Mg*", interval and average frequency (j) of neurons before (black) or after (red) 
picrotoxin and NBQX). Note that isolated NMDAR EPSCs are completely ketamine treatment. Each line represents values before or after treatment 
blocked by AP5. c-f, Ketamine (c), AP5 (d), NBQX (e) and mibefradil with ketamine from the same LHb neuron. n= 9 neurons, 2 rats. Data are 
(f) do not affect RMPs of LHb neurons. n = 10 neurons, 5 rats (c); 7 = 17 mean + s.e.m.; *P < 0.05, n.s., not significant. Two-tailed paired t-test. 


neurons, 6 rats (d); 7 = 11 neurons, 6 rats (e); 7 = 10 neurons, 3 rats (f). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 
3 +150 pA Bb R?=0.38 © 
Ramp current | 59 pA P<0.0001 


-50 pA = = 3 ° 
> =" 564 ao 
< E = 
(0) — (0) Oo@o 
= [= pn 
5 2 a4 @GE® O 
= s + | @®@ 0 
p 3 39 
=| ~ a 
9 2 6 
g a a= 
= =o 
-80 -60 -40 -80 -60 -40 -80 -60 -40 
Burst initial voltage (mV) 
f 
20 pA 
—_ -20 pA OpA 
> 
€ 
fo} 
N 
500 ms 
-40 mV -60 mV -46 mV 
h road 
ACSF Ketamine (100 uM) | ACSF = AP5 (100 uM) 
z a 
oO 
a 
fe} 
| 8 
——~ br 
: <-— va " 
\ a | 
100 ms: \ 100 ms: \_| 
-100 pA one 
LT LT “LF “LST 


i 
ACSF Mibefradil (10 uM) 


© 6 
oO 
2 
2 g 
a 
100 ms: 400 ie 
“LT -100 pA “LI 
7 “LP-100 pAL 
K csr Fluoxetine (30 uM) a 
1.0 
208 
% 
8 0.6 
\. B04 
n 
\ 02 
100 ms: 0.0 fi 
“LT -100 pA “LI & RS 
& 
Extended Data Figure 6 | Voltage dependency of LHb bursts and burst-firing mode by hyperpolarization (e), and a spontaneously bursting 
pharmacological manipulation of hyperpolarization-triggered rebound —_ neuron transformed to tonic-firing mode by depolarization (f). 
bursts in LHb. a, Representative trace of an LHb neuron transformed g-k, Example traces (left) and statistics (right) showing effects of ketamine 
from bursting to tonic-firing mode with a ramp-like current injection, (g), AP5 (h), mibefradil (i), ZD7288 (j) and fluoxetine (k) on rebound 
showing bursting at more hyperpolarized potentials and tonic firing at bursts induced by a transient hyperpolarizing current step. Current 
more depolarized potentials. Spikes in bursting and tonic-firing mode injection steps are illustrated under the bottom of the trace. n =9 neurons, 
are shown in blue and black, respectively. b-d, Correlations of membrane 2 rats (g); m= 12 neurons, 5 rats (h); m = 14 neurons, 2 rats (i); n=8 
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e, f, Example traces of a spontaneously tonic-firing neuron transformed to 
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Extended Data Figure 7 | T-VSCC currents and RMPs of LHb neurons 
in animal models of depression. a, Voltage steps used to isolate T-VSCC 
currents, starting from a holding potential of —50 mV before being 
increased to conditioning potential (—100 mV) for 1s preceding the 
command steps (5 mV, 0.1 Hz per step increment). b, LHb T-VSCC 
currents (right column) are obtained by subtraction of recorded traces 
without (left) mibefradil from those with mibefradil (101M, middle). 
The maximum of isolated T-VSCC current is obtained at —50 mV. 
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c, d, No difference in LHb T-VSCC currents is detected between wild-type 
and cLH rats (c) or control and CRS mice (d). n =4 neurons per group 


from 2 wild-type and 2 cLH rats (c);n =5 neurons per group from 
2 control and 2 C57 mice (d). e, f, Scatter plots showing that neuronal 


RMPs 


are more hyperpolarized in cLH rats (e) and CRS mice (f) than in controls. 


n=45 neurons per group from 5 wild-type and 4 cLH rats (e); n=50 
neurons per group from 4 control and 3 C57 mice (f). Data are mean 4 
*** P< 0,001, ****P < 0.0001. Two-tailed unpaired t-test (e, f). 
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Extended Data Figure 8 | Simulation of LHb neurons incorporating 
T-VSCCs and NMDAR channels. a, Scheme of a single compartment 
model of an LHb neuron (see Methods). b, The contribution of NUDAR 
current Iympar and T-VSCC current I; during bursts derived from 
simulation. c, RMP-dependent firing mode of the LHb model neuron. 
Spikes in bursting mode are shown in blue. Spikes in tonic and silent 
firing mode are shown in black. d, The correlation between RMPs and 
intra-burst frequencies of the LHb model neuron. e, f, Example trace (left) 
and statistics (right) showing in silico effects of ketamine (set NUDAR 
conductance gxmpar=0, e) or mibefradil (gr-vscc = 0, f) on spontaneous 
bursts in the LHb model neuron. The bursting probability was evaluated 
across ten independent trials with simulated synaptic inputs. Note that 

in silico knockout of NMDAR or T-VSCC current from the model abolished 
the bursts, which matched experimental observations (Fig. 3a, d, 4a). 

g, h, Example trace (left) and statistics (right) showing in silico effects of 
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NBQX (gampar=0, g) or AMPA (gampar increased from 8 to 15 1S cm’, 
h) on spontaneous bursts in the LHb model neuron. n = 10 simulations 
(e-h). i, An example trace summarizing the ionic components and channel 
mechanisms involved in LHb bursting: hyperpolarization of neurons to 
membrane potentials negative to —55 mV slowly de-inactivates T-VSCC. 
Ir continues to grow as the de-inactivated T-VSCCs increase, leading to 
a transient Ca’* plateau potential. The Ca** plateau helps remove the 
magnesium blockade of NMDARs while T-VSCC inactivates rapidly 
during the depolarization. After the Ca?* plateau reaches approximately 
—45 mV, Inmpa dominates the driving force to further depolarize RMP 
to the threshold for Na spike generation. As RMP falls back to below 

—55 mV it de-inactivates Jy and results in the intrinsic propensity of LHb 
neurons to generate the next cycle of bursting. Data are mean + s.e.m.; 
#7 D < (1.0001. Two-tailed paired t-test. 
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Extended Data Figure 9 | AMPA or picrotoxin increases LHb burst 
frequency. a, b, Example traces (left) and statistics (right, sampled within 
3 min before and 1 min after drug application) showing effects of AMPA 
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Extended Data Figure 10 | Additional behavioural results of eNpHR3.0- 
or oChIEF-based photostimulation. a, Representative trace showing that 
LHb neurons can follow only the first of the five 100-Hz pulsed blue light 
stimulations in a pulsed 100-Hz protocol in LHb brain slices infected with 
AAV2/9-oChIEF. Percentage of responsive neurons shown on the right. 
b-d, Pulsed 100-Hz photostimulation of mice expressing oChIEF does 

not change locomotion in the OFT (b), and does not induce depressive 
phenotypes in the FST (c) or SPT (d). n=6, 7 mice (b, c) and 5, 5 mice (d) 
for oChIEF and eGFP groups, respectively. e, Representative trace showing 
LHb neurons following a 5-Hz tonic blue light stimulation protocol in 
LHb brain slices infected with AAV2/9- oChIEF. Percentage of responsive 
neurons shown on the right. f-h, 5-Hz photostimulation of LHb in mice 
expressing oChIEF does not change locomotion in the OFT (f) and does 
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not induce depressive phenotypes in the FST (g) or SPT (h). n=7, 6 

mice (f); 6, 6 mice (g) and 5, 12 mice (h) for oChIEF and eGFP groups, 
respectively. i, 20-Hz tonic photostimulation of the LHb in mice expressing 
oChIEF does not cause RTPA. Note that this result is different from 20-Hz 
optogenetic stimulation of a presynaptic input from the entopeduncular 
nucleus into the LHb, which caused aversion!!. n=5, 7 mice for oChIEF 
and eGFP group. j, k, eNpHR3.0-driven burst does not change speed of 
movement in RTPA (j) or affect locomotion in the OFT (k). n=11, 7 mice 
(j) and 13, 21 mice (k) for NpHR and eGFP group, respectively. Data are 
mean + s.e.m.; n.s., not significant. Two-way ANOVA (b, d, f, h, k), two- 
tailed unpaired t-test (c, g, i) and paired t-test (j). 
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Astroglial Kir4.1 in the lateral habenula 
drives neuronal bursts in depression 


Yihui Cui>?, Yan Yang!, Zheyi Ni!, Yiyan Dong!, Guohong Cai’, Alexandre Foncelle*, Shuangshuang Ma!, Kangning Sang!, 
Siyang Tang!, Yuezhou Li!, Ying Shen!, Hugues Berry**, Shengxi Wu? & Hailan Hu! 


Enhanced bursting activity of neurons in the lateral habenula (LHb) is essential in driving depression-like behaviours, 
but the cause of this increase has been unknown. Here, using a high-throughput quantitative proteomic screen, we 
show that an astroglial potassium channel (Kir4.1) is upregulated in the LHb in rat models of depression. Kir4.1 in the 
LHb shows a distinct pattern of expression on astrocytic membrane processes that wrap tightly around the neuronal 
soma. Electrophysiology and modelling data show that the level of Kir4.1 on astrocytes tightly regulates the degree of 
membrane hyperpolarization and the amount of bursting activity of LHb neurons. Astrocyte-specific gain and loss of 
Kir4.1 in the LHb bidirectionally regulates neuronal bursting and depression-like symptoms. Together, these results 
show that a glia-neuron interaction at the perisomatic space of LHb is involved in setting the neuronal firing mode in 
models of a major psychiatric disease. Kir4.1 in the LHb might have potential as a target for treating clinical depression. 


A major breakthrough in neuroscience was the discovery that astro- 
cytes interact intimately with neurons to support and regulate essential 
functions and thereby foster information processing in the brain! *. 
Many investigations have focused on astroglia—neuron interactions 
at tripartite synapses, where astrocyte processes wrap tightly around 
presynaptic and postsynaptic sites”. By contrast, less attention has been 
placed on astroglia—neuron interactions in proximity to the neuronal 
soma. In particular, how astrocytes regulate the intrinsic firing patterns 
of neurons, and what structural basis may underlie this regulation, have 
been much less explored. 

Despite the surging interest in the role of the LHb in depression 
only limited attention has been given to the influence of astrocyte 
malfunction in the LHb on the aetiology of depression’®. In an accom- 
panying paper, we show that the bursting activity of LHb neurons is 
greatly enhanced in animal models of depression”. LHb bursting drives 
depressive-like behaviours and is a prominent target of the rapid anti- 
depressant ketamine?°. However, the cause of this enhanced bursting 
by LHb neurons remains unknown. 


10-18 
> 


LHb Kir4.1 is upregulated in depression 

We used an unbiased, high-throughput, quantitative proteomic screen 
to compare habenular protein expression between congenitally learned 
helpless (cCLH)*' and wild-type Sprague-Dawley rats. Kir4.1 was highly 
upregulated in the habenulae of cLH rats'* (1.69-fold of wild-type, 
Extended Data Fig. 1a). Western blot analysis confirmed that Kir4.1 
was significantly increased (1.44-fold, P=0.009) in the membrane pro- 
tein fraction of habenulae from cLH rats (Fig. 1a). By contrast, another 
astrocyte-specific protein, glial fibrillary acidic protein (GFAP), showed 
no change in expression (Extended Data Fig. 1b), indicating that there 
was no astrogliosis. 

To test whether Kir4.1 upregulation is universal in depression, we 
examined rats in which depression was induced by treatment with 
lipopolysaccharide (LPS)””. One week of LPS injection (0.5 mg kg! 
per day, intraperitoneal (i.p.)) was sufficient to cause strong depressive- 
like phenotype in three-month-old Wistar rats in the forced swim test 
(FST; Extended Data Fig. 1c)?” and sucrose preference test (SPT). 


Rats with LPS-induced depression also showed a significant increase 
in Kir4.1 in the habenula (1.87-fold, P< 0.0001, Fig. 1b). Quantitative 
real-time PCR revealed an increase (1.2-fold) in Kir4.1 (also known 
as Kcnj10) mRNA level in habenulae from cLH rats (Extended Data 
Fig. 1d), suggesting that at least part of the change in protein level is 
due to transcriptional upregulation. 

Kir4.1 is a principal component of the glial Kir channel and is 
largely responsible for mediating the KT conductance and setting the 
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Figure 1 | Kir4.1 is upregulated in the LHb in rat models of depression. 
a, b, Western blot analysis showing upregulation of Kir4.1 protein in 
membrane fraction of habenulae of cLH rats (n=9, 9 rats for WT and 
cLH, respectively, a) and rats with LPS-induced depression (n =6, 6, b). 
Tubulin is used as loading control. WT, wild-type Sprague-Dawley rats. 

c, d, I-V plot and bar graph showing Ba?+-sensitive currents in cLH rats 
and wild-type controls at the age of P60-90 (c) and P30 (d). e, f, Age- 
dependent FST (e) and LHT (f) phenotypes of cLH rats. n = 8, 8 rats 

for P30, n= 10, 10; 6, 6 rats for P90 FST and LHT, respectively. Data are 
means +s.e.m. *P< 0.05, **P< 0.01, ****P < 0.0001; NS, not significant. 
Two-tailed paired t-test (a, b), two-tailed unpaired t-test (c, e, f) and 
Mann-Whitney test (d). 
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Figure 2 | Kir4.1 is expressed on astrocytic processes that wrap tightly 
around neuronal somata in LHb. a, Immunohistochemistry signals of 
Kir4.1 envelope neuronal somata as indicated by white arrows (three 
independent experiments). b, The pan-soma Kir4.1 signals remain intact 
in the LHb of Kir4.1 floxed mice injected with AAV-CaMKIla::eGFP-Cre, 
but are eliminated in mice injected with AAV-GFAP::eGFP-Cre (three 
independent experiments). c, Immunogold electron microscopy of Kir4.1. 
Red arrows indicate gold signals surrounding a neuronal soma (three 
independent experiments). d, I-V plots of the Ba”*-sensitive Kir4.1 
current recorded in LHb astrocytes and neurons. Top left, representative 
traces; bottom left, bar graph of current recorded when cells are held at 
—120 mV. Two-tailed Mann-Whitney test. Data are means + s.e.m. 

#EEP < 0.001. 


resting membrane potential (RMP) in astrocytes***°. To confirm that 
Kir4.1 function is indeed upregulated in rat models of depression, we 
performed whole-cell patch-clamp recordings onto LHb astrocytes 
from cLH or wild-type rats. Astrocytes were distinguished from neu- 
rons by their small (5-10,1m), oval somata and electrophysiological 
features (Extended Data Fig. 2a—d; see Methods). Biocytin filling 
and the absence of NeuN co-staining confirmed that cells fitting the 
above criteria were indeed astrocytes (Extended Data Fig. 3). We 
then bath-applied Ba** (BaCh, 100 uM), which selectively blocks Kir 
channels at sub-millimolar concentrations, to isolate Kir4.1 current”* 
(Extended Data Fig. 2a, c). The Ba**-sensitive current displayed a 
reversal potential close to the equilibrium potential of K* (Ex) 
(—90mV)734 (Fig. 1c, d), indicating that it represents the K* conduct- 
ance. We found that Ba?*-sensitive currents in LHb astrocytes were 
almost doubled in astrocytes from cLH (Fig. 1c) and LPS-treated rats 
(Extended Data Fig. 2f) at postnatal days (P)60-90. 

Notably, the increase in Kir4.1 current and protein level was not 
obvious in cLH rats at P30 (Fig. 1d, Extended Data Fig. le). At this age, 
cLH rats did not yet show depression-like phenotypes in either the FST 
(Fig. le) or the learned helplessness test (LHT; Fig. 1f), suggesting that 
the upregulation of Kir4.1 is concomitant with the developmental onset 
of the depression-like symptoms. 


Kir4.1 on astrocytes encircles neuronal soma 

As an inwardly rectifying K* channel, Kir4.1 has been strongly 
implicated in buffering excess extracellular K* in tripartite 
synapses”>-?8. The conventional model of Kt buffering suggests 
that Kir4.1 is highly expressed in astrocytic endfeet surrounding 
synapses”. Unexpectedly, immunohistochemical staining for Kir4.1 
in the LHb appeared to overlap with the neuronal marker NeuN at low 
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Figure 3 | Astrocytic kir4.1 overexpression increases neuronal bursts 

in the LHb and causes depression-like phenotypes. a, Top, schematics of 
AAV vectors engineered to overexpress Kir4.1 or a eGFP control under a 
GFAP promoter. Bottom, illustration of bilateral viral injection of 
AAV-GFAP::Kir4.1 in mouse LHb (stained with antibody against GFP and 
Hoechst) (four independent experiments). b, Experimental paradigm for 
electrophysiology and behavioural testing. c-e, Astrocytic overexpression 
of Kir4.1 decreases RMPs of both astrocytes (n = 10, 8 astrocytes from 4, 3 
mice for eGFP and Kir4.1, respectively, c) and neurons (n = 26, 29 neurons 
from 6, 6 mice, d) and increases the bursting population in neurons (e). 

f, g, Behavioural effects of expression of various viral constructs in the 
LHb in the FST (n=7, 8 mice, f) and SPT (n =6, 8 mice, g). Data are 
means +s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001. 
Two-tailed unpaired t-test (c, d, f, g) and x? test (e). 


magnification (20x; Extended Data Fig. 4a), although in the 
same brain slice Kir4.1 staining in the hippocampus showed a 
typical astrocytic appearance (Extended Data Fig. 4b). However, 
higher magnification imaging with single-layer scanning (0.76 1m 
per layer) revealed that Kir4.1 staining enveloped NeuN signals 
(Fig. 2a). To confirm that Kir4.1 was located within astrocytes but 
not neurons in the LHb, we separately knocked it out in either 
neurons or astrocytes by injecting AAV viruses expressing the Cre 
recombinase under either the neuronal Camk2a promoter or the 
glial GFAP promoter (gfaABCID)”?! into Kir4.1 floxed (Kenj10") 
mice**, Neuronal knockout of Kir4.1 expression did not change 
the pattern of Kir4.1 staining, but astrocytic knockout completely 
eliminated it (Fig. 2b). Electron microscopy imaging revealed that 
Kir4.1-positive gold particles encircled the membrane of neuronal 
cell bodies (Fig. 2c, Extended Data Fig. 5a, b) as well as synapses 
(Extended Data Fig. 5c). Consistently, whole-cell patch-clamp 
recordings showed that Ba”*-sensitive currents were absent from 
neurons but abundant in astrocytes in the LHb (Fig. 2d, Extended 
Data Fig. 6). Together these results suggest that Kir4.1 is mainly 
expressed in astrocytic processes that wrap tightly around neuronal 
somata and synapses in the LHb. 


Kir4.1 gain-of-function causes depression 

To test the consequences of Kir4.1 upregulation in the LHb, we 
used adeno-associated viruses of the 2/5 serotype (AAV2/5), which 
preferentially target astrocytes’, together with the human GFAP 
(gfaABC1D) promoter”! to deliver eGFP-tagged Kir4.1 channel 
(AAV-GFAP::Kir4.1) or eGFP alone (AAV-GFAP::eGFP) as a control 
(Fig. 3a). Twenty-one days after bilateral injection into the LHb at P50, 
AAV2/5-mediated viral transfection led to expression of Kir4.1 and 
eGFP in astrocytes throughout the LHb (Fig. 3a). The specificity of 
the viral infection in astrocytes was verified by co-immunostaining of 
NeuN and eGFP: only 0.3% of NeuN™ cells (1 =2,668) were infected 
by this virus (Extended Data Fig. 7a). We made whole-cell patch-clamp 
recordings from either astrocytes or neurons surrounding the virally 
transfected astrocytes in coronal LHb slices (Fig. 3b-e). The RMPs of 
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Figure 4 | Kir4.1-dependent potassium buffering regulates neuronal 
RMP and bursting in LHb. a, d, Changes in neuronal RMPs caused by 
BaCl, (100 1M, n =7, 11, 12 neurons from 3, 3, 3 rats for silent, bursting 
and tonic, respectively, a) or TTX (11M, n= 13, 9, 9 neurons from 2, 2, 2 
rats, d) in different neuronal types. RMPs were measured 15 min or 5 min 
after perfusion with BaCl or TTX, respectively, when the RMPs had 
stabilized. b, e, Changes in neuronal RMPs after BaCl; (n = 14, 9 neurons 
from 3, 3 rats for tonic and burst group, respectively, b) or TTX (n=9, 9 
neurons from 2, 2 rats, e) treatment correlate with original firing rates of 
tonic-firing neurons (blue) or intra-burst frequencies of bursting neurons 
(red). c, Representative trace (left) and bar graph (right, 9 neurons from 


both astrocytes and neurons were more hyperpolarized (Fig. 3c, d) and 
the percentage of bursting neurons was significantly higher (Fig. 3e, 
P<0.0001) in mice infected with AAV-GFAP::Kir4.1 than in mice 
infected with AAV-GFAP::eGFP. 

We then assayed depression-like phenotypes and found that mice with 
AAV-GFAP::Kir4.1 infection in the LHb displayed severe depression- 
like behaviours, including increased immobile duration and decreased 
latency to immobility in the FST (Fig. 3f), and decreased sucrose pref- 
erence in the SPT (Fig. 3g), while general locomotion was unchanged 
(Extended Data Fig. 7c, d). 


Kir4.1 regulates neuronal RMP and bursting 

How does an astrocytic K* channel regulate the RMP and burst 
firing of LHb neurons? We hypothesize that within the highly con- 
fined extracellular space between neuronal somata and Kir4.1- 
positive astrocytic processes (Fig. 2), the constantly released Kt 
from intrinsically active LHb neurons is quickly cleared by astro- 
cytes through a Kir4.1-dependent mechanism. Accordingly, we 
predict that blockade of Kir4.1 should compromise spatial buffering 
of K*, resulting in increased extracellular K* ({K] out), and accord- 
ing to the Nernst equation, depolarized neuronal RMPs (modelled 
in Extended Data Fig. 8). Consistent with this prediction, blockage 
of Kir4.1 by bath perfusion with BaCl, for about 10 min depolarized 
the RMPs of LHb neurons (Fig. 4a). The degree of change in RMP 
positively correlated with the original firing rates of neurons (Fig. 4b), 
indicating that the more active the neuron is, the larger contribution 
the K* buffering makes to its RMP. A similar amount of RMP 
change was induced when BaCl, was applied in the presence of 
synaptic transmission blockers (picrotoxin, 1,2,3,4-tetrahydro- 
6-nitro-2,3-dioxo-benzo[f]quinoxaline-7-sulfonamide (NBQX) and 
2-amino-5-phosphonopentanoic acid (AP5); Extended Data Fig. 9a-c), 
suggesting that Kir4.1-dependent regulation of RMP in LHb neurons 
occurs mostly at neuronal cell bodies instead of at synapses. Consequent 
to the RMP change and accumulation of [K]out, perfusion with BaCl 
caused either a shift in the spiking pattern from bursting to tonic firing 
and eventually to cessation of neuronal activity (5 of 9 neurons; Fig. 4c), 
or a prolonged plateau potential within a burst and eventually cessation 
of bursting firing (4 of 9 neurons; Extended Data Fig. 9d). 


Neuronal RMP (mV) 
b & 
oOo Oo 


3 rats) showing effect of BaCl, (15 min after drug perfusion) on bursting 
neurons. Spikes in bursting and tonic firing modes are shown in blue and 
black, respectively. f, Example of LHb neurons transforming from tonic- 
firing to burst-firing mode after [K] ou: is reduced from normal (2.75 mM) 
by 50% (to 1.4mM). g, h, Reducing [K]out by 50% decreases neuronal 
RMPs (n= 15 neurons from 4 rats, sampled 1 min after change in [K]out, g) 
and increases bursting neural population (n = 33, 40 neurons from 4, 6 rats 
for normal and reduced [K]out, respectively, h). Data are means + s.e.m. 
*P<0.05, **P< 0.01, ***P< 0.001, ****P < 0.0001. Two-tailed paired 
t-test (a, c, d, g), x? test (h) and linear regression test (b, e). 


By contrast, upregulation of Kir4.1 or prevention of neuronal fir- 
ing should decrease [K]out and hyperpolarize neuronal RMPs. Indeed, 
overexpression of Kir4.1 in astrocytes (Fig. 3d) or blockage of neuronal 
action potentials by tetrodotoxin (TTX; Fig. 4d, e) caused hyperpo- 
larization of LHb neurons. Overexpression of Kir4.1 in astrocytes 
also increased neuronal bursting (Fig. 3e). Finally, to assess a causal 
relationship between [K] out and firing mode, we made current-clamp 
recordings from LHb neurons while lowering [K] ou, by 50% (from 
2.75 mM to 1.4mM, Fig. 4f). This led to a reduction in neuronal RMP 
of 13.7+0.5mV (Fig. 4g) and a direct shift of originally tonic-firing 
neurons (8 out of 15) into bursting mode. Consequently, the percent- 
age of bursting neurons was increased from 8% to 23% (Fig. 4h). Thus, 
by increasing astrocytic Kir4.1 expression or decreasing extracellular 
Kt concentration, we were able to phenocopy in wild-type animals 
several key neuronal properties observed in the LHb of animal 
models of depression, namely hyperpolarized RMPs and enhanced 
bursts. These results indicate that enhanced extracellular Kt clearance 
resulting from overexpression of Kir4.1 might underlie the neuronal 
hyperpolarization required for burst initiation. 


Kir4.1 loss-of-function rescues depression 

To determine whether loss-of-function of Kir4.1 in the LHb could 
reduce neuronal bursts and reverse depressive phenotypes, we devised 
two strategies. We used AAV2/5 viral vectors to express either a short 
hairpin RNA (shRNA) to knock down the level of Kir4.1, or a dominant- 
negative construct to block its function, in the LHb of cLH rats (Fig. 5a). 
We tested six shRNAs that specifically targeted the Kir4.1 transcript 
in cell culture and chose the one with highest knockdown efficiency 
(Fig. 5b and Extended Data Fig. 10a) for viral packaging. To avoid an 
off-target effect of the shRNA, we also used a dominant-negative form 
of Kir4.1 (dnKir4.1), which contains a GYG-to-AAA point mutation 
at the channel pore, which blocks K* channels”* (Fig. 5a, c, Extended 
Data Figs 7b, 10b). We first examined the effect of Kir4.1-shRNA on 
glial and neural electrophysiological properties. In astrocytes infected 
with AAV-H1::Kir4.1-shRNA, we observed a marked change in the 
current-voltage (I-V) relationship (Fig. 5d) and a depolarization of 
41 mV compared with astrocytes infected with control shRNA (Fig. 5e), 
consistent with previous findings that Kir4.1 is mainly responsible for 
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Figure 5 | Loss of function of Kir4.1 in LHb decreases neuronal bursting 
and rescues depression-like phenotypes of cLH rats. a, Schematics of 
the AAV vector engineered to overexpress shRNA or dominant-negative 
Kir4.1. H1, human H1 promoter; CAG, CMV early enhancer/chicken 
Actb promoter. Three point mutations in the dnKir4.1 and their locations 
in the channel pore are shown below. b, Western blot and quantification 
showing efficient knockdown of Kir4.1 by shRNA in HEK293TN cells. 

c, Top, experimental paradigm for behavioural testing; bottom, illustration 
of bilateral expression of AAV-GFAP::dnKir4.1 in LHb (stained with 
antibodies against GFP and Hoechst). d-g, AAV-Kir4.1-shRNA caused 

a shift in reversal potential from —72 to —40 mV in astrocytes (d), 
depolarized astrocytes (e) and neurons in viral infected region (f), and 
abolished neuronal bursting (g). Floating bars for membrane slope 
conductance in d are calculated from the I-V plots (between —120 and 


setting astrocytic RMPs”*. The RMPs of neurons infected with AAV- 
H1::Kir4.1-shRNA did not differ from those of neighbouring non- 
infected neurons (because neurons do not express Kir4.1 endogenously, 
Fig. 5f). However, the RMPs of neighbouring LHb neurons in AAV- 
H1::Kir4.1-shRNA-infected brain slices were overall more depolarized 
than the RMPs of neurons from rats infected with the control shRNA 
(Fig. 5f), suggesting that knock-down of Kir4.1 in astrocytes had a 
widespread impact on the RMPs of neighbouring neurons. Similar 
effects were observed in LHb slices infected with AAV-GFAP::dnKir4.1 
(Extended Data Fig. 10c-f). Notably, bursting activity in the LHb of 
cLH rats was eliminated by infection with AAV-H1::Kir4.1-shRNA 
(Fig. 5g) or AAV-GFAP::dnKir4.1 (Extended Data Fig. 10f). 
Behaviourally, infection with AAV-H1::Kir4.1-shRNA or AAV- 
GFAP::dnKir4.1 caused a pronounced reduction in the depression-like 
phenotypes of cLH rats in three depression paradigms: it reduced 
immobility in the FST (Fig. 5h), increased bar pressing number 
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+40 mV). h-k, Behavioural effects of expression of various viral 
constructs in the LHb of cLH rats in the FST (h), LHT (i, j) and SPT (k). 

j, Percentage of rats in each category. LH, learned helpless rats with <5 
lever presses; NLH, non-learned helpless rats with > 10 lever presses. 

1, FST immobility is highly correlated with learned helplessness phenotype. 
m, A model for mechanisms of depression and ketamine treatment in the 
LHb. Upregulation of Kir4.1 on astrocytic processes surrounding neuronal 
somata leads to enhanced K* buffering at the tight neuron-glia junction, 
decreased [K] ou: and hyperpolarized neuronal RMP. Consequently, 
de-inactivation of T-VSCCs initiates NMDAR-dependent bursts, causing 

a stronger output of LHb to trigger depression. Ketamine blockade of 
NMDARs stops bursting and relieves depression. Data are means + s.e.m. 
*P<0.05, **P< 0.01, ***P < 0.001, ****P < 0.0001, NS, not significant. 
(See Supplementary Table 1 for statistical analysis and n numbers.) 


in the LHT (Fig. 5i, j), and increased sucrose preference in the SPT 
(Fig. 5k). The behaviour scores in the LHT correlated with those in 
the FST (Fig. 51). 


Concluding remarks 

Here we describe an important function of Kir4.1 in regulating 
neuronal RMP and firing pattern at the highly specialized neuron- 
glia interface in the LHb. During depression, upregulation of Kir4.1 
may cause enhanced extracellular K* clearance, leading to a decrease 
in [K] ou: and neuronal hyperpolarization (Fig. 5m). As demonstrated 
in the accompanying publication”, neuronal hyperpolarization may 
de-inactivate T-type voltage-sensitive calcium channels (T-VSCCs), 
which in turn initiate NMDAR-dependent bursts and thereby increase 
suppression of downstream monoaminergic centres (Fig. 5m). These 
results may inspire the development of new treatments for major 
depression targeting maladaptive neuron-glia interactions in the LHb. 
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We also expect that the perisomatic K* buffering mechanism described 
here may have a more widespread function. 
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METHODS 


Animals. Male cLH rats (3-4 weeks or 8-12 weeks of age) and age-matched male 
Sprague Dawley rats (SLAC Laboratory Animal Co.) were used. The cLH rats were 
screened using the learned helpless test!*! for breeding as previously described”). 
Male Wistar rats (SLAC Laboratory Animal Co, 12 weeks) were used for estab- 
lishing the LPS-induced depressive-like rat model. Male adult (7-8 weeks of age) 
C57BL/6 mice (SLAC) were used for virus injection in the behaviour experiments. 
Kir4.1 floxed (Kenj10”) mice (originally obtained from K. McCarthy at University 
of North Carolina) were used for virus injection in the immunohistochemistry 
experiments. Animals were group-housed two per cage for rats and four per cage 
for mice under a 12-h light-dark cycle (light on from 7a.m. to 7p.m.) with access 
to food and water ad libitum. All animal studies and experimental procedures 
were approved by the Animal Care and Use Committee of the animal facility at 
Zhejiang University. 

Western blotting. The habenular membrane fraction and whole proteins were 
extracted as previously described!*. Animals were anaesthetized using isoflurane, 
and habenular tissue was quickly dissected from the brain and homogenized in 
lysis buffer (320mM sucrose, 4mM HEPES pH 7.4, 1mM MgCl and 0.5mM 
CaCl, 5mM NaF, 1mM Na3VO,, EDTA-free, protease inhibitor cocktail tablets 
(Roche) on ice. The lysis buffer used for extracting the total protein of HEK293TN 
cells contained 50 mM Tris (pH 7.4), 150mM NaCl, 1% Triton X-100, 1% sodium 
deoxycholate, 0.1% SDS and protease inhibitor cocktail tablets (Roche). After 
protein concentration measurement by BCA assay, 10-20 1g protein for each lane 
was separated on a 10% SDS-PAGE gel and transferred for western blot analysis. 
Rabbit anti-Kir4.1 intracellular peptide (1:1,000, Alomone labs), mouse anti-GFAP 
(1:1,000, Sigma), mouse anti-c-tubulin (1:5,000, Sigma) and mouse anti-GAPDH- 
HRP (1:5,000, KangChen Bio-tech Inc.) antibodies, and high-sensitivity ECL 
reagent (GE Healthcare) were used. All the bands were analysed with Quantity 
One or Image J. 

Immunohistochemistry. Animals were anaesthetized using 10% chloral hydrate, 
and then perfused transcardially with ice-cold PBS (pH 7.4) followed by 4% para- 
formaldehyde. After overnight post fix in 4% paraformaldehyde solution, brains 
were cryoprotected in 30% sucrose for 1 day (for mice) or 3 days (for rats). Coronal 
sections (40|1m) were cut on a microtome (Leica) and collected in PBS and stored 
at 4°C for further use. The antibodies used were rabbit anti-Kir4.1 extracellular 
peptide (1:200, Alomone labs), mouse anti-GFAP (1:500, Sigma), mouse anti- 
NeuN (1:500, Millipore), rabbit anti-NeuN (1:500, Millipore), mouse anti-S100B 
(1:500, Sigma), chicken anti-GFP (1:1,000, Abcam), mouse anti-Flag (1:1,000, 
Beyotime), Alexa Fluor 488 goat anti-rabbit IgG, Alexa Fluor 488 goat anti-chicken 
IgG, Alexa Fluor 546 goat anti-mouse IgG (all 1:1,000, Invitrogen). Specifically, for 
Kir4.1 staining, the rabbit anti-Kir4.1 extracellular peptide antibody was incubated 
for 48-72 h and the other primary antibodies were incubated for 36-48 h. For 
the antibody absorption experiments, the rabbit anti-Kir4.1 extracellular peptide 
antibody was pre-adsorbed with the Kir4.1 extracellular antigen by mixing at a 
weight ratio of 1:2 for 24h. Slices for checking the injection site were counter- 
stained with Hoechst in the final incubation step. Fluorescent image acquisition 
was performed with an Olympus Fluoview FV1000 confocal microscope and a 
Nikon A1 confocal microscope. 

Cell transfection and cell culture. Human embryonic kidney (HEK293) cells (gift 
from J. Luo) were used for the electrophysiology recording and HEK293TN cells 
(Taitool Bioscience) were used for western blot analysis. Cells used in this study 
were authenticated and checked for mycoplasma contamination. The plasmids 
used were pAAV-Ubi-Kir4.1-2A-eGFP, pAAV-Ubi-dnKir4.1 (GYG to AAA)- 
2A-eGFP, pAAV-CAG-eGEP, pAAV-H1-Kir4.1-shRNA-CAG-eGFP and pAAV- 
H1-Luciferase-shRNA-CAG-eGFP. HEK293 or HEK293 TN cells were grown in 
DMEM (HyClone) with 10% FBS (HyClone) at 37°C and 5% CO). Cells were 
transfected with appropriate constructs 24 h after plating using Lipofectamine 
2000 (Invitrogen) according to the manufacturer's instructions. Culture medium 
was replaced 5-6h after transfection. Cells were cultured for another 48h before 
western blotting or electrophysiology experiments. 

Plasmid constructs. The pAAV-Ubi-Kir4.1-2A-eGFP plasmid was assem- 
bled by homologous recombination of an AAV backbone linearized from the 
AAV-Ubi-CaMKII-2A-eGFP plasmid™ by PCR and Kir4.1 amplified from 
a pZac2.1-gfaABC1D-eGFP-Kir4.1 plasmid (AddGene). The pAAV-Ubi-dnKir4.1 
(GYG to AAA)-2A-eGFP plasmid was made by PCR-based mutagenesis using 
pAAV-Ubi-Kir4.1-2A-eGFP as a backbone (fw: 5’-ACCATTGCCGCCGCC 
TTCCGCTACATCAGCGA-3’; rev: 5‘-GGCGGCGGCAATGGTGGTCTGG 
GATTCGAGGGA-3’). The pAAV-gfaABC1D- dnKir4.1(GYG to AAA)-2A-eGFP 
plasmid was assembled by homologous recombination of a pZac2.1-gfaABC1D 
backbone linearized from the pZac2.1-gfaABC1D-eGFP-Kir4.1 plasmid 
(AddGene) by PCR and a Kir4.1dn-2A-eGFP sequence amplified from the pAAV- 
Ubi-dnKir4.1(GYG to AAA)-2A-eGFP plasmid. The pAAV-H1-Kir4.1-shRNA- 
CAG-eGFP plasmid was constructed using a vector (Taitool Bioscience), which 


contains a CAG promoter driving eGFP and an H1 promoter driving shRNA 
expression. We designed six shRNA sequences using RNAi designer online 
software (http://rnaidesigner.thermofisher.com/rnaiexpress/; Invitrogen) as 
indicated below: 

1) 5‘-GGACGACCTTCATTGACAT-3/ 

2) 5’-GCTACAAGCTTCTGCTCTTCT-3’ 

3) 5’-GCTCTTCTCGCCAACCTTTAC-3’ 

4) 5‘-CCGGAACCTTCCTTGCAAA-3/ 

5) 5’-GCGTAAGAGTCTCCTCATTGG-3’ 

6) 5’‘-GCCCTTAGTGTGCGCATTA-3/ 

We then tested the knockdown efficiency by western blot of Kir4.1 from 

HEK293TN cells which were co-transfected with Flag-tagged-Kir4.1 plasmid 
(pAAV-CMV-betaGlobin-Kir4.1-eGFP-3Flag) and each of the six shRNA plas- 
mids. Based on our western blot result (Extended Data Fig. 10a), we chose the 
fifth sequence, 5’-GCGTAAGAGTCTCCTCATTGG-3’, for the Kir4.1-shRNA 
virus package. 
Electron microscopic immunohistochemistry. Four mice were deeply anaes- 
thetized with 1% sodium pentobarbital intraperitoneally (50 mg/kg body weight) 
and perfused transcardially with 20 ml saline, followed by 40 ml ice-cold mix- 
ture of 4% paraformaldehyde and 0.05% glutaraldehyde in 0.1 M PB for 1h. 
Brainstems were removed and postfixed by immersion in the same fixative for 4h 
at 4°C. Serial coronal sections of 501m thickness were prepared with a vibratome 
(VT 1000S, Leica), and approximately 18-20 sections, including the LHb region, 
were collected from each brain. 

Kir4.1 was detected by immunogold-silver staining. In brief, sections were 
blocked with blocking buffer (5% BSA, 5% NGS and 0.05% Triton X-100 in PBS), 
and then incubated overnight with primary antibodies (rabbit anti-Kir4.1 extra- 
cellular peptide; 1:200, Alomone labs) diluted with solution containing 1% BSA, 
1% NGS and 0.05% Triton X-100.The secondary antibody was anti-rabbit IgG 
conjugated to 1.4-nm gold particles (1:100, Nanoprobes) for 4h. After rinsing, sec- 
tions were post fixed in 2% glutaraldehyde in PBS for 45 min. Silver enhancement 
was performed in the dark with HQ Silver Kit (Nanoprobes) for visualization of 
Kir4.1 immunoreactivity. Before and after the silver enhancement step, sections 
were rinsed several times with deionized water. 

Immunolabelled sections were fixed with 0.5% osmium tetroxide in 0.1 M PB 

for 1h, dehydrated in graded ethanol series and then in propylene oxide, and finally 
flat-embedded in Epon 812 between sheets of plastic. After polymerization, acrylic 
sheets were then peeled from the polymerized resin, and flat-embedded sections 
were examined under the light microscope. Three to four sections containing 
Kir4.1 immunoreactivity in the LHb were selected from each brain, trimmed under 
a stereomicroscope, and glued onto blank resin stubs. Serial ultrathin sections were 
cut with an Ultramicrotome (Leica EM UC6, Germany) using a diamond knife 
(Diatome) and mounted on formvar-coated mesh grids (6-8 sections per grid). 
They were then counterstained with uranyl acetate and lead citrate, and observed 
under a JEM-1230 electron microscope (JEOL) equipped with a CCD camera and 
its application software (832 SC1000). 
Stereotaxic surgery and virus injection. cLH rats (P50-60) were deeply anaes- 
thetized using 4% pentobarbital. Mice (P50-60 days) were deeply anaesthetized 
using ketamine (100 mg/kg of body weight) and xylazine (8 mg/kg). Animals 
were placed on a stereotactic frame (RWD Instruments). A small volume of virus 
was injected into the LHb bilaterally (for rats LHb: AP, -3.7 mm from bregma; 
ML, + 0.7 mm; DV, -4.55 mm from the brain surface; for mice: AP, -1.72 mm from 
bregma; ML, + 0.46 mm; DV, -2.62 mm from the brain surface) using a pulled 
glass capillary with a pressure microinjector (Picospritzer III, Parker) at a slow 
rate of 0.111/min. After the injection was completed, the capillary was left for an 
additional 10 min before it was then slowly withdrawn completely. After surgery, 
animals were allowed to recover from anaesthesia under a heat pad. 

The following vectors were used: AAV-CaMKIla::eGFP-Cre (AAV2/1-CaMKIla- 
HI-eGFP-Cre, 0.211, bilateral into LHb, University of Pennsylvania vector core), 
AAV-GFAP::eGFP-Cre (AAV2/5-gfaABC1D-eGFP-Cre, titre: 4.74 x 10” v.g./ml, 
dilution: 1:2, 0.2 ul, bilateral into LHb, Taitool Bioscience), AAV-GFAP::Kir4.1 
(AAV2/5-gfaABC1D-eGFP-Kir4.1, titre: 9.19 x 10? v.g./ml, dilution: 1:5, 0.211, 
bilateral into LHb, Taitool Bioscience), AAV-GFAP::eGFP (AAV2/5-gfaABC1D- 
eGFP, titre: 1.61 x 10"? v.g./ml, dilution: 1:5, 0.21, bilateral into LHb, University 
of Massachusetts, Gao Laboratory), AAV-H1::Kir4.1-shRNA (AAV2/5-H1- 
Kir4.1-shRNA-CAG-eGEP titre: 3.04 x 10° v.g./ml, dilution: 1:10, 0.2 ul, bilateral 
into LHb, Taitool Bioscience), AAV-H1::Ctrl-shRNA (AAV2/5-H1-Luciferase- 
shRNA-CAG-eGEP, titre: 1.46 x 10!° v.g./ml, dilution: 1:5, 0.211, bilateral into 
LHb, Taitool Bioscience), AAV2/5-gfaABC1D-dnKir4.1-2A-eGFP(GYG to AAA) 
(titre: 4.15 x 10" v.g./ml, 0.2 ul, bilateral into LHb, Taitool Bioscience). All viral 
vectors were aliquotted and stored at -80°C until use. 

Depression model and behaviour assay. LPS-induced depression. The LPS- 
induced depression model was used as previously described”*. Wistar male rats 
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(3 months) were used for the experiments. LPS (Sigma, L-2880) dissolved in sterile 
0.9% saline was intraperitoneally injected into Wistar rats, at a dosage of 0.5 mg/kg. 
This dosage was used to stimulate a subclinical infection without inducing obvious 
inflammation and other apparent impairments in animals. Saline or LPS 
was injected between 09:30 and 10:30 a.m. daily for 7 days. The forced swim 
test was performed 24h after the last injection. The habenular tissue was 
dissected 24h after the behavioural test for western blotting or electrophysiology 
experiments. 
Learned helplessness test (LHT). Male juvenile (P30) or adult (P90) cLH rats 
were tested in a lever-pressing task to evaluate the learned helplessness (LH) 
phenotype". A cue-light-illuminated lever in the shock chamber was presented, 
which terminated the shock when rats pressed the lever. Fifteen escapable shocks 
(0.8 mA) were delivered with a 24-s inter-shock interval. Each shock lasted up to 
60s unless the rat pressed the lever to terminate the shock. Out of the 15 trials, 
rats that failed to press the lever for more than ten trials were defined as learned 
helpless (LH), and rats with fewer than five failures were defined as non-learned 
helpless (NLH). 
Forced swim test (FST). Animals were individually placed in a cylinder (12cm 
diameter, 25 cm height for mice; 20cm diameter, 50cm height for rats) of water 
(23-25 °C) and swam for 6 min under normal light*?. Water depth was set to 
prevent animals from touching the bottom with tails or hind limbs. Animal behav- 
iours were videotaped from the side. The immobility time during the last 4min 
of the test was counted offline by an observer blinded to the animal treatments. 
Immobility was defined as time when animals remained floating or motionless 
with only movements necessary for keeping balance in the water. For rats, an addi- 
tional pre-test was conducted 24h before the test, during which rats were individu- 
ally placed in a cylinder of water under the conditions described above for 15 min. 
Sucrose preference test (SPT). Animals were single housed and habituated with 
two bottles of water for 2 days, followed by two bottles of 2% sucrose for 2 days*’. 
Animals were then water deprived for 24h and then exposed to one bottle of 2% 
sucrose and one bottle of water for 2h in the dark phase. Bottle positions were 
switched after 1h. Total consumption of each fluid was measured and sucrose 
preference was defined as the average sucrose consumption ratio during the first 
and second hours. Sucrose consumption ratio was calculated by dividing total 
consumption of sucrose by total consumption of both water and sucrose. 
Open field test (OFT). Animals were placed in the centre of an arena 
(40cm x 40cm x 40.5cm for mice; 100 cm x 100cm x 40cm for rats) in a room 
with dim light for 10 min. A video camera positioned directly above the arena was 
used to track the movement of each animal (Any-maze, Stoelting). 
LHb brain slice preparation. Rats (P25-30 or P60-90) and mice (P90) were anaes- 
thetized with isoflurane and 10% chloral hydrate, and then perfused with 20 ml 
ice-cold ACSF (oxygenated with 95% O, + 5% CO2) containing (mM): 125 NaCl, 
2.5 KCI, 25 NaHCO3, 1.25 NaH2POg, 1 MgCl and 25 glucose, with 1 mM pyruvate 
added. The brain was removed as quickly as possible after decapitation and put 
into chilled and oxygenated ACSE. Coronal slices containing the habenula (350 1m 
and 300 1m thickness for rats and mice, respectively) were sectioned in cold ACSF 
with a Leica 2000 vibratome, transferred to ASCF at 32°C for incubation and 
recovery for 1h and then transferred to room temperature. ACSF was continu- 
ously gassed with 95% O, and 5% COs. Slices were allowed to recover for at least 
1h before recording. 
In vitro electrophysiological recording. For LHb neuron recordings, currents 
were measured under whole-cell patch-clamp recordings using pipettes with 
a typical resistance of 5-6 MQ filled with internal solution containing (mM) 
105 K-gluconate, 30 KCl, 4 Mg-ATP, 0.3 Na-GTP, 0.3 EGTA, 10 HEPES and 10 
Na-phosphocreatine, with pH set to 7.35. For the biocytin filling, 5 mg/ml bio- 
cytin was added in the internal solution. The external ACSF solution contained 
(in mM) 125 NaCl, 2.5 KCI, 25 NaHCOs, 1.25 NaH»PO,, 1 MgCh and 25 glucose. 
Cells were visualized with infrared optics on an upright microscope (BX51WI, 
Olympus). A MultiClamp 700B amplifier and pCLAMP 10 software were used for 
electrophysiology (Axon Instruments). The series resistance and capacitance were 
compensated automatically after a stable Gigaseal was formed. Spontaneous neu- 
ronal activity was recorded under current-clamp (I= 0 pA) for 60s consecutively. 
RMP was determined during the silent period of neuronal spontaneous activity. 
To test the effects of TTX (11M, Sigma) and BaCl, (100,1M, Sigma) on neuronal 
RMP and spontaneous activity, baselines were recorded for at least for 3 min. Then 
the drug was perfused into the medium with the arrival of the drug being precisely 
indicated with a bubble that was added before the transition from normal ACSF to 
ACSF with drug. The effect of TTX on the RMP and spontaneous activity of LHb 
neurons stabilized after a few minutes, whereas the effect of BaCl, took more than 
10 min to stabilize. We thus analysed the effects of TTX and BaClh at 5 and 15 min 
after perfusion started, respectively. 
Astrocytic whole-cell patch-clamp recordings and Kir4.1 current isolation. 
Astrocytes were distinguished from neurons by their small (5-10 1m), oval somata 
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and distinct electrophysiological features: a hyperpolarized RMP (—74+1mV), a 
low input resistance Ri, (47 + 6 MQ), a linear J-V relationship and an absence of 
action potentials in response to increased injection currents***°. BaCl, (100M, 
Sigma) was applied to isolate Kir4.1 current, which was subtracted from the 
I-V curve recorded from —120 mV to 0 mV. Biocytin (Sigma, 5 mg/ml) was dis- 
solved into the patch-clamp pipette solution. After electrophysiological charac- 
terization, cells were held for at least 30 min in voltage clamp and constantly 
injected with a hyperpolarization current (500 ms, 50 pA, 0.5 Hz, 30 min) to allow 
biocytin filling (performed at 34°C). Subsequently, slices were fixed overnight 
in 4% paraformaldehyde at 4°C. The antibodies used were rabbit anti-NeuN 
(1:500, Millipore), Alexa Fluor 546 donkey anti-rabbit IgG (1:1,000, Invitrogen) 
and Cy2-conjugated streptavidin (1:1,000, Jackson ImmunoResearch). 
Fluorescent image acquisition was performed with a Nikon Al confocal 
microscope. 

Tri-compartment model. A tri-compartment model was constructed including 
the neuron, the astrocyte and the extracellular space. The model was based on 
ionic fluxes between these three compartments. Na‘ and K* voltage-gated chan- 
nels, and leak channels were recruited into the neuron as dVy/dt = Japp — Ix — 
Ta — Iteak.n)/CN» Where Vy is the neuronal membrane potential, cy is the neuronal 
capacitance, Ip» is an external current applied to the neuron, Ina and Ix are the fast 
Na‘ and K* currents of the action potentials, respectively, and I,eak.n is responsible 
for the neuronal resting membrane potential. To trigger an action potential, we 
applied an external step current of amplitude 5 nA and duration 0.1 ms. Kir4.1 
channels on the extracellular and vessel side and leak channels were recruited into 
the astrocyte as dVa/dt=(—Ikir — Tkir_vess — Iteak,a)/Ca, Where Va is the astrocytic 
membrane potential, cq is the astrocytic capacitance, Iki, and Ikir yess account for the 
K* flow on the side of the neuron and vessel, respectively (Iki: is outward during 
the resting state, and becomes inward when K* equilibrium potential exceeds the 
astrocytic membrane potential), and Ip eak,a is responsible for the astrocytic resting 
membrane potential. 

The dynamics for K* in the three compartments are described by: 


d[{K*], /dt = (Txir a Ix) /(F x Volo) = 2(ipump,N + ipump,A) 
—dx_o([K*], — Ko_o) 


d[K*], /dt =— Ik/(F x Voly) + 2ipump,N X Volo/Voln 


— dx_n([K*],, — Kn_o) 


d{K*], /dt = (—Ikir = Txir_vess) /(F x Vola) + 2ipump,A x Volo/Vola 
— dx_a([K*], — Ka_o) 


where [K*]o, [Kt] and [K*], are the extracellular, neuronal and astrocytic K* 
concentrations, respectively; F is the Faraday constant; Volo, Vol, and Voly are the 
volumes of the extracellular, astrocytic and neuronal compartments, respectively; 
and ipump, aNd ipump,a are the fluxes through the Na*/K* pump of the neuron and 
astrocyte membranes, respectively. The term d([X] — [Xo]) describes the diffusion 
of Kt in the considered compartment, where dx_o, dx_~ and dx 4 are the rates of 
extracellular, neuronal and astrocytic K* effective flux; Ko 0, Kn_o,and Ka_o are the 
extracellular, neuronal and astrocytic K* concentrations expected at equilibrium, 
respectively. The extracellular K* is mainly contributed by Ix. Released extracel- 
lular K* from the neuron is taken up by Kir4.1 channels and Nat/K* pumps. 
The dynamics for Nat in the three compartments are described by: 


d[Na‘Jo /dt= Tya/(F x Volo) + 3(ipump,N + ipump,A) 
— dya_o([Na‘]o —Nao_o) 


d{Na‘],, /dt = Tya/(F x Vol) _ 3ipump,N x Volo /Voly 
— dya_n([Na‘], — Nan_o) 


d[Na‘], /dt =— 3ipump,A x Volo/Vola —dya_a([Na' Ja —Naa_o) 


See Supplementary Information (Model Description and Supplementary Table 2) 
for detailed formulae. 

Statistical analyses. The required sample sizes were estimated on the basis of our 
past experience performing similar experiments. Animals were randomly assigned 
to treatment groups. Analyses were performed blinded to treatment assignments in 
all behavioural experiments. Statistical analyses were performed using GraphPad 
Prism software v6. By pre-established criteria, values were excluded from the 
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analyses if the viral injection or drug delivering sites were out of the LHb. All 
statistical tests were two-tailed, and significance was assigned at P < 0.05. 
Normality and equal variances between group samples were assessed using the 
D'Agostino and Pearson omnibus normality test and Brown-Forsythe tests, 
respectively. When normality and equal variance between sample groups was 
achieved, one-way ANOVAs (followed by Bonferroni’s multiple comparisons 
test) or t-tests were used. Where normality or equal variance of samples failed, 
Mann-Whitney U tests were performed. Linear regression tests and x? tests were 
used in appropriate situations. The sample sizes, specific statistical tests used, 


and main effects of our statistical analyses for each experiment are reported in 
Supplementary Table 1. 

Data availability. The data that support the findings of this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Figure 1 | Habenular protein expression in rat models GFAP protein in habenulae of cLH rats at P60-90. n = 4, 4 rats for control 
of depression. a, Volcano plot of high-throughput proteomic screen and cLH, respectively. c, LPS injection (500 jg kg”! ip. for 7 days) induces 
identifies proteins that are differentially expressed in the habenulae of increased immobile time and decreased latency to immobility in the FST. 
cLH rats versus wild-type rats. Ln (fold change) is In-transformed value n=8, 9 rats for saline and LPS, respectively. d, QPCR analysis of Kir4.1 
of the normalized protein ratio of cLH and control’. Significance value mRNA in habenulae. Two-tailed paired t-test. n =5, 5 rats for control and 
was calculated as the average normalized ratio minus two folds of s.d."4. cLH, respectively. e, Western blot analysis showing no change in Kir4.1 
Proteins in the shaded areas have more than 50% significant change. protein in membrane fraction of habenulae in cLH rats at P30. n=6, 6. 
Kir4.1 is one of the eight upregulated proteins identified!*. Dashed lines Data are means +s.e.m., *P< 0.05, n.s., not significant. Two-tailed paired 


indicate fold change of 50%. b, Western blot analysis showing no change in __ t-test (b, d, e); two-tailed unpaired t-test (c). 
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Extended Data Figure 2 | Ba”*-sensitive Kir4.1 current is upregulated and controls at P60-90 (b) and P30 (d). e, I-V plots of astrocytes in LPS- 
in LHb of adult cLH rats and adult LPS-injected Wistar rats. injected Wistar rats and saline controls at P60-90. f, I-V plot and bar graph 
a, c, Representative traces showing linear J-V curve in a typical astrocyte showing Ba”*-sensitive currents in LPS-injected Wistar rats and saline- 
before (upper) and after (middle) Ba** perfusion under voltage steps injected controls at P60-90. Two-tailed unpaired t-test. n= 7, 6 astrocytes 
(-130 mV to -30 mV, step by 10 mV, 2s duration, holding at -70 mV). from 2, 2 rats for saline and LPS, respectively. Data are means + s.e.m., 


Subtraction of the two led to Ba”*-sensitive Kir current (bottom) at *P<0.05. 
P60-90 (a) and P30 (c) in cLH rats. b, d, I-V plots of astrocytes in cLH rats 
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Extended Data Figure 3 | Biocytin intercellular filling and double 
staining with NeuN confirm the identity of eletrophysiologically 
identified neurons and astrocytes. a, b, A neuron (a) and an astrocyte (b) 
in LHb slices were first identified on the basis of their specific morphology 
(astrocytes: 5-10 ,1m diameter; neurons: ~15 1m diameter) and 
physiological properties. The neuron fires at a depolarizing voltage step (a), 
whereas the astrocyte shows a steady-state J-V relationship and a lack of 
spiking activity (b). After electrophysiological characterization, cells were 
held for at least 30 min in voltage clamp and constantly injected with a 
hyperpolarization current (500 ms, 50 pA, 0.5 Hz, 30 min) to allow biocytin 
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filling. c—h, Biocytin-labelled neurons and astrocytes subsequently 
confirmed by co-labelling with NeuN. ¢, d, Biocytin signals in a single 
neuron (c) or a group of astrocytes owing to diffusion through gap 
junctions (d) (four independent experiments). e, f, NeuN signals 

(four independent experiments). g, h, Colabelling of NeuN with the 
neuron (indicated by white arrow, g) but not astrocytes (h) (four 
independent experiments). Note that all biocytin-filled neurons (n = 18) 
colabel with NeuN and all biocytin-filled astrocytes (n = 11) do not colabel 
with NeuN. 
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hippocampus (b). Bottom two panels show staining with the same Kir4.1 
antibody pre-incubated with the antigen peptide, demonstrating the 
specificity of the Kir4.1 antibody (two independent experiments). 


Extended Data Figure 4 | Expression pattern of Kir4.1 in the LHb and 
hippocampus. a, b, Kir4.1 co-immunostaining with neuronal marker 
(NeuN) or astrocytic marker ($100b and GFAP) in the LHb (a) or 
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Extended Data Figure 5 | Electron microscopy immunohistochemistry the postsynaptic densities (arrowheads). d, Kir4.1 immunograins are also 


of Kir4.1 staining. a, b, Many Kir4.1 immunograins (arrows) surround detected surrounding a vascular endothelial cell. e, Inset shows Kir4.1 
the neuronal soma. c, Kir4.1 grains (arrows) also surround axon- immunograins near a gap junction. s, neuronal soma; t, axon terminal. 
dendrite synapses, but are rare near the synaptic zones as indicated by Scale bars, 0.5 |1m. Three independent experiments. 
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Extended Data Figure 6 | Kir4.1 is expressed in astrocytes but not 
neurons in the LHb. a, Schematics showing sequence of drug application 
and recording after a neuron or astrocyte is patched. b, Representative 
traces showing a linear I-V curve in a typical astrocyte under voltage steps 
(—130mV to —30 mV, step by 10 mV, 2s duration, holding at —70 mV, 
protocol demonstrated on left, upper panel). J-V curves of the same 

cell after addition of TTX (11M), 4-aminopyridine (4AP, 1 mM) and 
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c 
Ls 
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400 ms 


4-(N-ethyl-N-phenylamino)-1,2-dimethyl-6-(methylamino) pyrimidinium 
chloride (ZD7288, 50|1M) (middle) and further addition of Ba?* (100M, 
bottom) are shown below. c, Representative traces showing a nonlinear J-V 
curve in a typical neuron under voltage steps (—120 mV to —40 mV, step 

by 10 mV, 2s duration, holding at —60 mV, protocol demonstrated on left, 
upper panel). I-V curves of the same cell after addition of TTX, ZD7288 and 
4AP (middle) and further addition of Ba?* (bottom) are shown below. 
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Extended Data Figure 7 | Characterization of cell-type specificity of 
GFAP promoter, and locomotion. a, b, Double immunofluorescence for 
NeuN (red) and eGFP (green) in the coronal section of LHb brain slices 
infected with AAV-GFAP::Kir4.1 (AAV2/5-gfaABC1D-eGFP-Kir4.1) 
virus in mice (three independent experiments, a) or AAV-GFAP::dnKir4.1 
(AAV2/5-gfaABC1D-dnKir4.1-2A-eGFP) virus in cLH rats (two 
independent experiments, b). Left, examples of anterior, middle and 
posterior coronal sections of LHb. Numbers in the bottom right corner are 
the number of merged cells/number of NeuN* cells in the virus-infected 
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Total distance (m) 
Total distance (m) 
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area. Right, zoomed-in images of the white square area in left. Note that 
there is only one infected neuron, as indicated by the white arrow, in all 
three fields of view. c, d, Overexpression of Kir4.1 in the LHb of C57 mice 
does not affect locomotion. n =7, 8 mice for eGFP and Kir4.1, respectively. 
e, f, Overexpression of Kir4.1-shRNA in the LHb of cLH rats does not 
affect locomotion activities. n =7, 7 rats for control and Kir4.1 shRNA, 
respectively. Data are means + s.e.m; n.s., not significant. Two-tailed 
unpaired t-test (c-f). 
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Extended Data Figure 8 | Simulation of the dynamic potassium 
buffering effect of Kir4.1 in the tri-compartment model. a, Schematic 
representation of a tri-compartment model involving neuron, astrocyte 
and extracellular space (see Methods for details). b-d, Effects of increasing 
Kir4.1 expression on [K] ut (b), neuronal membrane potential (c) and 
astrocytic membrane potential (d). Ctrl, control condition with 1 x Kir4.1 
conductance; Depr, depression condition with 2 x Kir4.1 conductance. 
Grey shaded areas indicate application of 10 Hz tonic stimulation to 
neurons. Note that under this neuronal firing condition, [K] ou: is lower, 


and neuron and astrocyte are more hyperpolarized in the depression 
condition than the control. e-g, Effects of in silico TTX (blocking action 
potentials, gvq=0) or Ba*+ (blocking Kir4.1, gxira.1 = 0) treatments on 
[K]out (e), neuronal membrane potential (f) and astrocytic membrane 
potential (g) when neurons are under 10 Hz tonic stimulation. Grey 
shaded areas indicate in silico application of drugs. Note that TTX and 
Ba?* cause opposite changes to [K]ou:, neuronal membrane potential and 
astrocytic membrane potential. Neuronal spikes are not shown for clarity 
of presentation. 
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Extended Data Figure 9 | BaCl, caused depolarization of neuronal RMP 
in the presence of synaptic transmitter blockers. a, b, Representative 
trace (a) and bar graph (n=9 neurons from 3 rats; b) showing effect of 
BaCl, (100 1M) perfusion onto tonic-firing neurons that have been bathed 
with transmitter blockers (100 1M picrotoxin, 10 1M NBQX and 

100\1M AP5). c, Bar graph showing the level of RMP depolarization 
caused by BaCl, in the presence or absence of transmitter blockers. 
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n=9, 12 neurons from 3, 3 rats for with and without blockers, respectively. 
d, Representative trace showing effect of BaCl, (sampled 15 min after 

drug perfusion) on bursting neurons (n = 4 out of 9 neurons from 3 rats). 
Spikes in bursting and tonic-firing mode are shown in blue and black, 
respectively. Data are means + s.e.m., ***P < 0.001, n.s., not significant. 
Two-tailed paired t-test (b) and two-tailed unpaired t-test (c). 
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Extended Data Figure 10 | Characterization of Kir4.1 loss-of-function for eGFP and dnKir4.1, respectively. c, I-V plot and bar graph showing 
constructs. a, Flag-tagged-Kir4.1 plasmid (pAAV-CMV-betaGlobin- Ba**-sensitive currents blocked by AAV-dnKir4.1 in both cLH and wild- 
Kir4.1-eGFP-3Flag) was co-transfected with pAAV-vector expressing type rats. d-f, AAV-dnKir4.1 caused depolarization of RMP in astrocytes 
six different shRNAs (see Methods) of Kir4.1 or the negative control (n=9, 8, 9, 6 astrocytes from 2, 4, 2, 4 rats for wild-type eGFP, wild-type 
(shRNA of luciferase) into HEK293 TN cells. On the basis of knockdown dnKir4.1, cCLH eGFP and cLH dnKir4.1, respectively; d) and neurons in 
efficiency as shown in the western blot, Kir4.1-shRNA-5 was chosen for viral infected area (n = 54, 48, 45, 58 neurons from 2, 4, 2, 4 rats, e), and 
viral package (two independent experiments). b, J-V plot showing Kir4.1 abolished neuronal bursting (f) in both cLH and wild-type rats. Data are 
currents recorded in HEK293 cells transfected with pAAV-Kir4.1 together = means +s.e.m.; *P < 0.05, **P< 0.01, aed <0.001, ****P < 0.0001. 
with negative control pAAV-eGFP or pAAV-dnKir4.1 plasmid. Bars Two-tailed unpaired t-test (b-e) and y? test (f). 

represent the current values recorded at -160 mV. n= 18, 15 HEK293 cells 
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Teneurin-3 controls topographic circuit 
assembly in the hippocampus 


Dominic S. Berns!**, Laura A. DeNardo!?, Daniel T. Pederick!? & Liqun Luo? 


Brain functions rely on specific patterns of connectivity. Teneurins are evolutionarily conserved transmembrane proteins 
that instruct synaptic partner matching in Drosophila and are required for vertebrate visual system development. The 
roles of vertebrate teneurins in connectivity beyond the visual system remain largely unknown and their mechanisms 
of action have not been demonstrated. Here we show that mouse teneurin-3 is expressed in multiple topographically 
interconnected areas of the hippocampal region, including proximal CA1, distal subiculum, and medial entorhinal cortex. 
Viral-genetic analyses reveal that teneurin-3 is required in both CA1 and subicular neurons for the precise targeting of 
proximal CAI axons to distal subiculum. Furthermore, teneurin-3 promotes homophilic adhesion in vitro in a splicing 
isoform-dependent manner. These findings demonstrate striking genetic heterogeneity across multiple hippocampal 
areas and suggest that teneurin-3 may orchestrate the assembly of a complex distributed circuit in the mammalian brain 


via matching expression and homophilic attraction. 


The hippocampal region is critical for the acquisition of declarative 
memory and the neural representation of space’~*. The connections 
between hippocampal subregions and adjacent cortex are topograph- 
ically organized along both the dorsal-ventral and proximal-distal 
axes>. Along the proximal distal axis, proximal CA1, distal subicu- 
lum, and medial entorhinal cortex (MEC) neurons are specifically 
interconnected, as are distal CA1, proximal subiculum, and lateral 
entorhinal cortex (LEC) neurons®’. These two parallel circuits may 
be preferentially used for processing spatial and object-related infor- 
mation, respectively®. Genetic heterogeneity that could contribute 
to the observed anatomical and functional differentiation along 
the proximal-distal axis in CA1 has been reported®. However, the 
mechanisms that control the exquisite wiring specificity remain 
unknown. 

Since the proposal of the chemoaffinity hypothesis for establish- 
ing specific neuronal connections’°, many cell surface and secreted 
proteins have been discovered that guide developing axons to target 
regions and recognize specific synaptic partners'»'”. Members of the 
teneurin family of type I transmembrane proteins exhibit matching 
expression in pre- and postsynaptic partners and instruct synaptic 
partner choice in the Drosophila olfactory and neuromuscular sys- 
tems, probably through homophilic attraction'*'*. Teneurins are 
evolutionarily conserved, with four members in mammals!> that are 
dynamically expressed during brain development'®. Human teneurins 
are risk loci in bipolar disorder'’~”° and schizophrenia”!, and are 
implicated in other neurological disorders”””*. Teneurin-3 (Ten3) is 
required for proper dendrite morphogenesis and axon targeting in 
the vertebrate visual system”*’. Although Ten3 has been hypothe- 
sized to function as a homophilic attractant, no cellular or molecular 
mechanisms have been demonstrated. Furthermore, conflicting evi- 
dence exists as to whether vertebrate teneurins interact in trans ina 
homophilic manner?*°, and heterophilic trans interactions with the 
adhesion-type G-protein-coupled receptors latrophilins have been 
demonstrated*°-**. Here, we examine the role of Ten3 in the devel- 
opment of specific connections within the hippocampal region, and 
shed new light on its mechanism of action during mammalian neural 
development. 


Ten3 expression in hippocampal regions 

Using a custom antibody against a cytoplasmic epitope (Extended 
Data Fig. 1a), we found that Ten3 was expressed in highly specific 
regions of the postnatal brain. In particular, Ten3 was expressed in 
restricted domains of the developing hippocampal region (Fig. 1a), 
including proximal CA1, distal subiculum, and MEC. A second Ten3 
antibody against an extracellular epitope recapitulated this staining 
pattern (Extended Data Fig. 1a, d). Staining with both antibodies 
was abolished in Ten3 knockout mice (Ten3““/“*)4 (Extended Data 
Fig. 1b-e). Ten3 was most prominent in synaptic layers, including 
stratum lacunosum-moleculare of CA1 and the molecular layer of 
subiculum, consistent with Ten3 being present in the synaptic cleft**. 
Ten3 was also present in axons, dendrites, and cell bodies (Extended 
Data Fig. 1f-i). In situ hybridization revealed that Ten3 mRNA was 
expressed in all regions where Ten3 protein was observed (Fig. 1b 
and Extended Data Fig. 1j). In both CA1 and subiculum, Ten3 mRNA 
showed a graded distribution along the proximal-distal axis, peaking in 
proximal CA1 and distal subiculum (Fig. 1c and Extended Data Fig. 2). 

Remarkably, Ten3 protein and mRNA expression patterns cor- 
responded to the known topography of multiple connections in the 
hippocampal region. MEC neurons send axons to proximal CA1 
and distal subiculum****, proximal CA1 neurons project to distal 
subiculum?”** and MEC®”, and distal subicular neurons project to 
MEC®*”, All of these regions highly expressed Ten3 protein and mRNA 
(Fig. la—c). By contrast, LEC neurons are interconnected with distal 
CA1 and proximal subiculum®”**™, all of which expressed low or no 
Ten3. To further examine the relationship between Ten3 expression 
and topographic projections, we injected an anterograde viral tracer 
into MEC, and found that MEC axons and Ten3 protein clearly over- 
lapped in the molecular layers of proximal CA1 and distal subiculum 
(Fig. 1d). By contrast, LEC axons projected to distal CA1 and proximal 
subiculum, regions of low Ten3 expression (Fig. le). 

In summary, Ten3 expression matches with topographic connec- 
tivity between entorhinal cortex, CA1, and subiculum (Fig. 1f). Ten3 
protein and mRNA were also specifically expressed in subregions of 
the presubiculum, parasubiculum, medial mammillary nucleus, and 
anteroventral thalamic nucleus that are topographically connected 
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Figure 1 | Ten3 expression in the developing hippocampal region. 

a, Left: diagram of the hippocampal region on a horizontal section of 
P10 mouse brain. A, anterior; P, posterior; M, medial; L, lateral. Right: 
same section with Ten3 immunostaining. b, In situ hybridization for Ten3 
mRNA on a P9 horizontal section. In a and b, arrows denote proximal 
CA1; arrowheads, distal subiculum; asterisk, MEC. c, Quantification 

of Ten3 mRNA along the proximal-distal (P-D) axis of CA1 (n= 12 
sections, four mice) and subiculum (n= 14 sections, four mice) of P10 
horizontal sections. Insets: Ten3 mRNA (top) and DAPI (4’,6-diamidino- 
2-phenylindole) staining (bottom). The x axis represents bin along the 
proximal-distal axis of CA1 or subiculum. Shaded curves, mean + s.e.m. 


with subiculum or entorhinal cortex (Extended Data Fig. 3). Given 
the function of Drosophila teneurins in synaptic partner matching!?4, 
we hypothesized that Ten3 may act as a homophilic attractant to control 


the development of these precise wiring patterns. 


Ten3 knockout analysis 

To test the function of Ten3 in hippocampal circuit development, we 
focused on the projection from CA1 to subiculum, where proximal 
CAI neurons (Ten3-high) send axons to distal subiculum (Ten3- 
high), and distal CA1 neurons (Ten3-low) project to proximal subicu- 
lum (Ten3-low) (Fig. 2a—-c). We generated a Ten3 knock-in cre allele 
(Ten3°, Extended Data Fig. 4a), which functioned as a protein null 
and allowed us to visualize neurons that normally express Ten3 in 
a mutant background (Extended Data Fig. 4b-d). We injected the 
anterograde tracer Phaseolus vulgaris leucoagglutinin (PHA-L)*! 
into proximal CA1 of the dorsal hippocampus of Ten3 heterozygous 
(Ten3*'*, hereafter Ten3"') and knockout (Ten3°’“4, hereafter 
Ten3*°) mice, and analysed the distribution of labelled CA1 axons 
in subiculum (Fig. 2c-g). In Ten3" mice, proximal CA1-restricted 
injections labelled axons that arborized densely in distal subiculum, 
confirming the topography described in the rat*”** (Fig. 2c). In 
Ten3*®° mice, however, similarly placed injections in proximal CA1 
labelled axons that spread significantly more proximally in subicu- 
lum (Fig. 2d, e), with increased overall width (Fig. 2f) and a proxi- 
mal shift in the mean position (Fig. 2g) of the projection. Middle 
and distal CA1 injections revealed that the overall topography of the 
CA1—subiculum projection was less sharp in Ten3*° mice (Extended 
Data Fig. 5). Thus, Ten3 is required for the precise projections of 
proximal CA1 axons to distal subiculum. 
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d, Labelling of MEC axons projecting to hippocampus in P70 brain after 
AAV1-CMV-GFP injection in MEC (asterisk) (left) and Ten3 staining on 
the same section (right). GFP in layer III MEC axons overlaps with Ten3 in 
proximal CA1 (arrows) and distal subiculum (arrowheads); GFP in layer 
II MEC axons also overlaps with Ten3 in dentate gyrus (#). e, Labelling 

of LEC axons in P55 brain projecting to distal CA1 (arrow) and proximal 
subiculum (arrowhead). Asterisk identifies track to the more ventral 
injection site. f, Summary of topographic connections between MEC, 
proximal CA1, and distal subiculum (Sub) (green arrows), and between 
LEC, distal CA1, and proximal subiculum (red arrows). Scale bars, 200 jum. 


We next recorded excitatory postsynaptic currents (EPSCs) from 
proximal and distal subicular cells in hippocampal slices from Ten3*/t 
(Ten3“") and Ten3*° mice in response to electrical stimulation 
of the CAl—subiculum axon bundles (Fig. 2h). In Ten3’ slices, 
stimulation reliably evoked EPSCs in proximal and distal subicular 
neurons (Fig. 2i). In Ten3®° mice, evoked EPSC amplitudes in distal 
but not proximal subicular neurons were strongly reduced, consistent 
with the anatomical defect in Ten3*° mice (Fig. 2i, j). The paired pulse 
ratio was unaffected in proximal subicular neurons but was increased 
in distal subicular knockout cells, indicating a lower baseline release 
probability in Ten3 mutants (Fig. 2k). Given that, in knockout mice, 
some proximal CA1 axons still reached distal subiculum (Fig. 2d, e), the 
marked physiological changes suggest that Ten3 may also be required 
for formation or function of the synapses between proximal CA1 and 
distal subicular neurons, consistent with the persistent expression of 
Ten3 in adults (Fig. 1d) and the function of Drosophila teneurins in 
synapse formation'*””. 


CAI conditional knockout analysis 
To distinguish whether Ten3 is required in CA1, subiculum, or both, 
for the CA1l—subiculum projection, we generated a conditional Ten3 
allele (Ten3") that allowed us to delete Ten3 in cells expressing Cre 
recombinase (Extended Data Fig. 6). Because the CAl—subiculum 
projection develops postnatally (Extended Data Fig. 7), we injected 
cre-expressing lentivirus into either CA1 or subiculum at postnatal day 
0 (PO) to generate area-specific Ten3 knockouts. 

For CA1 conditional knockout, we injected a Cre-dependent antero- 
grade tracer into proximal CA1 of adults to visualize axons only from 
Cre-expressing neurons (Fig. 3a). In Ten3“" controls, proximal CA1 
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Figure 2 | Ten3 is required for the precise CA1—>subiculum projection. 


a, Sagittal section of P11 hippocampus, showing Ten3 expression in 
proximal CA1 and distal subiculum. A, anterior; P, posterior; D, dorsal; 
V, ventral. b, Diagram of the CAl—subiculum topographic projection, 
with Ten3-high regions and axons in red. c, d, PHA-L (green) injections in 
proximal CA1 and corresponding projections in subiculum of Ten3" (c) 
or Ten3*° (d) mice. e, Averaged normalized fluorescence intensity traces 
for proximal injections in CA1 (left) and corresponding projections in 
subiculum (right) for Ten3"' (black, n= 12 mice) and Ten3*° (red, n= 16 
mice). The x axis represents bin along the proximal-distal axis of CA1 

or subiculum. Shaded curves represent mean + s.e.m. at each bin. 

f, g, Projection width-at-half-maximum (f) and mean position (g) 

for Ten3"™ and Ten3*°, ****P < 0.0001 (Ten3™': n= 12; Ten3*°: n= 16; 
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two-tailed t-test). h, Diagram of stimulating electrode and recording 
sites. i, EPSC traces from proximal and distal subicular cells in Ten3 Wr 
and Ten3*° mice. j, Average EPSC amplitude in proximal (left) and 
distal (right) subicular cells from Ten3"7 and Ten3*° mice at increasing 
stimulation intensities. Proximal: Ten3“7, n= 12 cells; Ten3*°, n= 11 
cells; P> 0.05 for all stimulation intensities. Distal: Ten3”", n= 14 cells, 
6 mice; Ten3*°, n=9 cells, 7 mice; 75|1A stimulation, P= 0.035; 100 pA 
stimulation, P = 0.022, adjusted P values from two-tailed t-tests with 
Holm- Sidak’s correction. k, Paired pulse ratio for proximal (left) and 
distal (right) subicular cells from Ten3" and Ten3*° mice. (Proximal: 
Ten3"", n=9 cells, 6 mice; Ten3*°, n =8 cells, 7 mice; P= 0.5566. Distal, 
Ten3?: n=11 cells, 6 mice; Ten3*°: n=7 cells, 7 mice; P=0.0049, two- 
tailed t-test.) Scale bars, 200 1m. Error bars, mean + s.e.m. 
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Figure 3 | Ten3 conditional knockout in CA1. a, Experimental scheme. 
b, c, Adeno-associated virus (AAV) injections in proximal CA1 (left) 
and corresponding projections in subiculum (right) of Ten37 (b) and 
Tens! (c) mice. Red, injections; green, projections; white, Cre staining. 
d, Average normalized fluorescence intensity traces for proximal CA1 


Proximal «— Subiculum—- Distal 


injections (left) and corresponding subicular projections (right) for 
control (black, n= 14) and CA1 conditional knockout (red, n = 9) mice. 
e, f, Projection width-at-half-maximum (e) and mean position (f) for 
control and knockout mice. ***P = 0.0001, ****P < 0.0001, two-tailed 
t-tests. Scale bars, 200 um. Error bars, mean + s.e.m. 
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Figure 4 | Ten3 conditional knockout in subiculum. a, Experimental 
scheme. b, e, Example images from Ten3"? (b) and Ten3 (e) mice, 
showing axons from proximal CA1 PHA-L injection in red, and 
GFP-Cre in subiculum in green. Three 60-j1m sections are arranged from 
medial to lateral, 480-600 |1m between sections. Scale bars, 200 um. 

c, f, Heatmaps showing normalized PHA-L fluorescence intensity 


neurons sent axons to distal subiculum as expected (Fig. 3b, d). In 
Ten3!"f mice, however, proximal CA1 axons covered most of subicu- 
lum, with their intensity peaking near the centre (Fig. 3c, d). The width 
of the projection was significantly increased in Ten3™", and the mean 
position shifted proximally (Fig. 3d—f). Thus, Ten3 is required in CA1 
neurons for their precise axon targeting in subiculum. 


Subiculum conditional knockout analysis 
To test whether Ten3 is also required in subicular cells for CA1 axon 
targeting, we injected cre-expressing lentivirus into subiculum at PO, 
the anterograde tracer PHA-L into proximal CA] at P35, and analysed 
the position of CA1 axons in subiculum (Fig. 4a). Because Cre expres- 
sion covered a small region of subiculum relative to the target field of 
labelled CA1 axons, we asked whether CA1 axons targeted differently 
within Cre-expressing regions versus nearby Cre-negative regions. In 
Ten3? controls, CA1 axons were unaffected when projecting into Cre- 
expressing subicular areas, as analysed in serial histological sections 
(Fig. 4b, c and Extended Data Fig. 8a, c). In Ten3!! mice, however, 
labelled CA1 axons appeared to avoid the patch of Cre-expressing, 
Ten3-deleted subicular cells (Fig. 4e, f and Extended Data Fig. 8b, d). 
To visualize the relationship between the CAl—subiculum axon 
projections and GFP-Cre expression in the target, we plotted the 
intensities of CA1 axons and GFP-Cre on the same graph as height 
and colour, respectively. While Cre expression (‘hot’ coloured) did not 
affect the projection strength (height) in Ten3'” controls (Fig. 4d, h and 
Extended Data Fig. 8c), the projection strength was severely diminished 
in Cre-expressing patches in Ten3™" mice (Fig. 4g, iand Extended Data 
Fig. 8d). Because we only made a focal deletion of Ten3 in subiculum 
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(red, left) and normalized GFP-Cre intensity (green, right) in subiculum, 
same mice as b and e. Each row is one section, 120 1m between rows. 

d, g, Surface plots showing normalized PHA-L intensity as height, and 
normalized GFP-Cre intensity as colour, same data as b, c and e, f, 
respectively. h, i, Surface plots from additional Ten3"(h) and Ten3!! (i) 
mice. See Extended Data Fig. 8 for all mice analysed. 


with the lentivirus injection, there were many Ten3-expressing cells in 
nearby regions of distal subiculum. The simplest interpretation for this 
drastic phenotype is that Ten3-high proximal CA1 axons destined for 
the Ten3-deleted patch mistargeted to nearby Ten3-high distal regions, 
consistent with Ten3 in distal subiculum acting as an attractant for 
proximal CA1 axons (Fig. 5f). 


Ten3 promotes homophilic adhesion 
If Ten3 functions as a homophilic cell adhesion molecule as previ- 
ously proposed”®>43, it could directly mediate the recognition of 
Ten3-expressing target cells by Ten3-expressing axons. Ten3 has 
alternatively spliced isoforms that differ by the inclusion or exclusion 
of exon 12 and exon 20 (at splice sites A and B, hereafter) encoding 
9- and 7-amino-acid peptides in the extracellular epidermal growth 
factor-like (EGF-like) and NCL-1, HT2A, and Lin-41 (NHL) repeats, 
respectively**5 (Fig. 5a, b). We denote the splicing at site A as Ay (exon 
12 excluded) or A; (exon 12 included), and at site B as Bo (exon 20 
excluded) or B; (exon 20 included). We sequenced Ten3 cDNA clones 
from CA1 or subiculum at P8, a time when the CA1l—subiculum 
projection was still developing (Extended Data Fig. 7). We found two 
additional alternative splicing variants at site A (termed Az and A3) that 
were computationally predicted*®; the inclusion of these exons (12.1 
or 12.2) would add six or seven additional amino acids to exon-12- 
containing variants (Fig. 5a, b). Overall, CA1 and subiculum expressed 
similar splicing isoforms, most of which included inserts at sites A 
and B (Fig. 5c). 

We next performed cell aggregation experiments, and found that all 
isoforms tested promoted homophilic adhesion except ApBo, which 
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Figure 5 | Ten3 promotes homophilic adhesion in a splicing isoform- 
dependent manner. a, Partial Ten3 genomic regions (top) highlighting 
alternatively spliced exons (grey boxes) and constitutive exons (white 
boxes). Splice variant names are next to corresponding splicing pattern. 

b, Locations of alternative splicing sites A and B in Ten3 protein, and 
amino acid sequences produced. YD, tyrosine and aspartate. Scale bar, 

200 amino acids. c, Summary of cDNA sequencing from P8 subiculum 
(n=52 clones) and CA1 (n=49 clones). d, Aggregation of K562 cells 
expressing different Ten3 isoforms and GFP. Scale bar, 200 1m. Bottom 
right: A,B, aggregate stained for the N-terminal haemagglutinin (HA) 

tag (red). Arrowheads highlight membrane-localized Ten3 at cell-cell 
junctions. Scale bar, 20j1m. e, Quantification of aggregate sizes pooled from 
three biological replicates. Dotted red line shows size cutoff at 600 jum”. NS, 
not significant; ****P < 0.0001, adjusted P values from Dunn’s multiple 
comparisons test after Kruskal-Wallis test, comparing all conditions to 
GFP. GFP, n= 1449 particles above threshold; ApBo, n= 26; AoBy, n= 1179; 
A,Bo; n=4l11; A3Bi, n= 1268; A>Bi, n=628; A,B, n= 336. f, Summary of 
data (top) and working model (bottom). KO, knockout. See Discussion 

for details. 


lacks all alternatively spliced exons (Fig. 5d, e). ApBo was functionally 
expressed because it promoted adhesion with latrophilin-3-expressing 
cells (Extended Data Fig. 9), as previously reported for other teneurin 
and latrophilin family proteins*”. The AoBo isoform occurred in only 
one out of 101 sequenced clones, indicating that most Ten3 isoforms 
present in CA1 and subiculum were capable of interacting in trans 
to promote homophilic adhesion. When the three highest-expressed 
isoforms of Ten3 were tested in a mixed cell aggregation assay, all 
aggregates were mixed, indicating that the major Ten3 isoforms 
expressed in the hippocampus can interact with each other in trans 
(Extended Data Fig. 10). Together with the matching expression and 
conditional knockout phenotypes, these data suggest that homophilic 
Ten3 interactions between CA1 axons and subicular cell bodies and 
dendrites control the targeting specificity of proximal CA1 axons to 
distal subiculum. 


Discussion 

A striking feature of neural development is the formation of highly 
precise connections between neurons. Sensory and motor circuits 
have been extensively used to characterize the molecular control 
of wiring specificity!!!*4748, but relatively little is known about how 
neurons in complex high-order circuits find appropriate partners. Here, 
we have shown that Ten3 acts in both pre- and postsynaptic neurons 


ARTICLE 


in the hippocampus to control the assembly of a precise topographic 
projection. Loss-of-function phenotypes support a homophilic attrac- 
tion mechanism: when Ten3 is lost from CA1 neurons, proximal CA1 
axons spread throughout the entire subiculum, instead of projecting 
only to distal, Ten3-high targets; when Ten3 is lost from a subset of 
distal subicular cells, Ten3-high proximal CA1 axons do not target these 
areas and instead innervate nearby Ten3-high regions (Fig. 5f). Our 
in vitro data further show that Ten3 can interact homophilically in 
trans, supporting a model in which Ten3 on CAI axons interacts with 
Ten3 on subicular targets, leading to contact-mediated attraction or 
stabilization of proximal CA1 axons by distal subicular target cells. This 
mechanism of action resembles that of the Drosophila teneurins in the 
development of olfactory and neuromuscular connections, suggesting 
an evolutionarily conserved mode of teneurin function in neural cir- 
cuit assembly from insects to mammals. However, whereas Drosophila 
teneurins instruct matching of discrete types of pre- and postsynap- 
tic cell, the graded expression in both CA1 and subiculum suggests 
that mouse Ten3 directs continuous topographic mapping along the 
proximal-distal axis (Fig. 5f). 

Our model does not exclude the possibility that interactions of Ten3 
with heterophilic partners have additional roles in circuit assembly. 
While in many cases we observed Ten3 expression in both pre- 
and postsynaptic partners of specific connections, there were also 
cases where Ten3 was only observed in axons but not targets (for 
example, MEC—dentate gyrus—CA3). Further, the AgBy isoform did not 
exhibit homophilic interactions but did interact heterophilically with 
latrophilin-3. These observations suggest that interactions between 
Ten3 and latrophilins or other potential heterophilic partners may also 
contribute to wiring specificity. 

Our results highlight small regions of Ten3 that are critical for trans 
interactions. Splice site A corresponds to the most C-terminal of the 
eight EGF-like repeats, which are thought to mediate cis-dimeriza- 
tion of teneurin proteins**’. Our results suggest that the EGF-like 
repeats may also participate in trans interactions, or that teneurin cis 
interactions may influence trans interactions. Splice site B is within 
the NHL-repeat region, which was implicated in homophilic teneurin 
interactions using single-cell force spectroscopy~’. Our result supports 
the importance of the NHL repeats, and identifies specific residues that 
are required for homophilic interactions. 

What controls the distal CA1—proximal subiculum projection? 
Since none of the other three mouse teneurins exhibited differential 
expression along the proximal-distal axis of the hippocampal for- 
mation’®, other differentially expressed proteins might act in parallel 
with Ten3 to control the distal CAl—proximal subiculum projection. 
Axon-axon competition could also contribute, as in retinotopic map 
development*”. Indeed, the enhanced severity of the CA1-specific 
conditional knockout phenotype compared with the whole-animal 
Ten3*° phenotype (Fig. 3d-f versus Fig. 2e-g) may result from Ten3- 
expressing CA1 axons out-competing mutant axons for space in distal 
subiculum, supporting a role of axon-axon competition in determining 
CA1—subiculum targeting specificity. 

Finally, our findings reveal genetic heterogeneity within many areas 
of the hippocampal region. Although our genetic analyses focused 
on the CA1—subiculum projection, Ten3-high to Ten3-high connec- 
tivity was also observed in the entorhinal—hippocampal projections 
(Fig. 1d-f), and probably exists in additional hippocampus-associated 
projections (Extended Data Fig. 3). The matching expression of Ten3 
in multiple topographically connected subregions, combined with our 
loss-of-function and in vitro data, suggests that Ten3 may control the 
assembly of a widely distributed circuit in mammalian brains. 

Online Content Methods, along with any additional Extended Data display items and 


Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Statistics and reproducibility. All statistical analyses were performed in Prism 
7 (GraphPad). No statistical methods were used to determine sample size. The 
experiments were not randomized. Experimenters were blind to genotype during 
injections and analysis of axon tracing, but not for electrophysiology or other 
experiments. Images of Ten3 immunostaining and in situ hybridization (Figs la, b 
and 2a and Extended Data Figs 1b-j, 3a—g, 4b-d and 6b-d) are representative of 
at least three separate experiments in all cases. Images of entorhinal cortex axon 
tracing (Fig. 1d, e) are representative of three injections. For developmental tracing, 
images shown are representative of three (P2 and P4) or two (P6 and P8) exper- 
iments. For subiculum conditional knockout (Fig. 4 and Extended Data Fig. 8), 
results from all experiments are shown in Extended Data Fig. 8. 

Mice. All animal procedures followed animal care guidelines approved by Stanford 
University’s Administrative Panel on Laboratory Animal Care. Both male and 
female mice were used, and mice were group housed. CD-1 mice from Charles 
River Laboratories were used for all wild-type expression studies. Ten3“4 mice”* 
were provided by M. Sur and maintained on a CD-1 background. Ten3 and Ten3!" 
were maintained on mixed CD-1, C57BL/6, and 129 backgrounds. Ten3* mice 
used in this study still had the neomycin resistance cassette (Neo) in the genome. 
Ten3" mice were crossed to a germline-active GFP-FlpO line*! to delete the Neo. 
Both Neo-deleted and Neo-containing Ten3" mice were used for Figs 3 and 4. 
Generation of Ten3 and Ten3" mice. Mice were generated by the Gene Targeting 
and Transgenics core at Janelia Research Campus. Ten3’ was generated by homol- 
ogous recombination in embryonic stem cells using standard procedures. Ten3! 
was generated by co-injection of the targeting construct with Cas9 mRNA and 
guide RNAs targeted just outside the loxP sites. Target sequences for gRNAs 
were GGGTGTCTAGAAAGTCAGTG AGG and AAAGTCCTTCATGGGCT 
TGC AGG. Targeting was verified in embryonic stem cells by long-arm PCR. 
After microinjection, chimaeras were bred with CD-1 females and F, offspring 
were screened by long-arm PCR and Southern blot to identify mice with germline 
transmission of the correctly targeted construct. 

Generation and affinity purification of Ten3 antibodies. Rabbit polyclonal anti- 
bodies (produced by YenZym Antibodies) were raised to the following peptides: 
Ten3IC (used in main figures), amino acids 163-176 (C)ENRSDSESEQPSNN; 
Ten3EC, amino acids 346-364 (C)DTFENGKVNSDTVPTNTVS. Affinity purifi- 
cation from serum was performed using target peptides immobilized on SulfoLink 
columns (Thermo Fisher, 44999). Peptides were coupled to columns following the 
manufacturer’s instructions. After blocking and washing, 2 ml antiserum mixed 
with 2 ml PBS, pH 7.4, was added to the column, and left on a nutator overnight 
at 4°C. The next day, columns were washed with 40 ml PBS pH 7.4, 40 ml PBS 
pH 7.4 + 0.5M NaCl, and 40 ml PBS pH 7.4, all by gravity flow. Columns were 
eluted with 20 ml 100 mM glycine pH 2.5. Fractions (1 ml) were collected into 
tubes containing 60,11 1 M Tris-HCl pH 8.8 for neutralization. All washes and 
elutions were performed at room temperature. Antibody concentration was meas- 
ured using A2g9 nm on a NanoDrop 1000, immunoglobulin-G setting. Fractions 
containing protein were combined and dialysed against PBS, pH 7.4, using Slide- 
a-lyzer cassettes (Thermo Fisher, 66380). Antibody concentration was measured 
again using A2g0 nm, and BSA (Thermo Fisher, 15561020) was added to raise total 
protein concentration to 1 mg ml”. Aliquots were snap frozen in liquid nitrogen 
and stored at —80°C. 

Immunostaining. Mice were given an overdose of 2.5% Avertin and were per- 
fused transcardially with PBS followed by 4% paraformaldehyde (by weight) in 
PBS. For Ten3 immunostaining, brains were dissected, post-fixed in 4% para- 
formaldehyde for 1h, and cryoprotected for 24-48 h in 30% sucrose. For all other 
immunostaining, brains were dissected, post-fixed in 4% paraformaldehyde for 
12-24h, and placed in 30% sucrose for 24-48 h. They were then embedded in 
Optimum Cutting Temperature (OCT, Tissue-Tek) on dry ice and stored at —80°C 
until sectioning. Floating sections (601m) were collected into PBS. Sections were 
incubated in the following solutions at room temperature unless otherwise indi- 
cated: 1 x 5-10 min in PBS, 1h in 0.3% PBS/Triton X-100 (PBST) and 10% nor- 
mal donkey serum (NDS), two to four nights in primary antibody at 4°C in 5% 
NDS in PBST, 2 x 15 min in PBST, 2h in secondary antibody in 5% NDS in PBST, 
2 x 15min in PBST, 30 min in DAPI (1:10,000 of 5 mg ml}, Sigma-Aldrich) in 
PBS, and 5-10 min in PBS. Sections were mounted on Superfrost Plus slides and 
coverslipped with Fluoromount-G (SouthernBiotech). For adult Ten3 staining, 
tissue was fixed overnight. Antigen retrieval on sections was performed in 10mM 
sodium citrate, pH 8.5, at 80°C for 30 min. Tissue was allowed to cool, washed three 
times with PBS, then blocked and immunostained as above. Primary antibodies 
used were rabbit anti-Ten3IC (1:1,000, four nights in primary antibody); rabbit 
anti-Ten3EC (1:200, four nights in primary antibody), mouse anti-Cre (1:1,000, 
Millipore, MAB3120), rabbit anti-PHA-L (1:1,000, Vector Labs, AS-2300, 
two nights in primary), chicken anti-GFP (1:2,500, Aves Labs, GFP-1020), and 
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rat anti-mCherry (1:1,000, Thermo Fisher, M11217). Secondary antibodies con- 
jugated to Alexa 488, Alexa 647, or Cy3 (Jackson ImmunoResearch) were used 
at 1:500 from 50% glycerol stocks. For Biotinylated Dextran Amine (BDA) axon 
tracing, sections were washed for 5 min in PBS, incubated in PBST for 30 min, 1:200 
streptavidin-Cy3 in PBST overnight, washed 3 x 15 min in PBST, and stained with 
DAPI and mounted as described above. Sections were imaged on Zeiss epifluores- 
cence and 780 confocal microscopes and on a Leica Ariol automated fluorescence 
slide scanning system. Images were processed in FIJI to stitch multiple fields- 
of-view of single sections and to adjust contrast and brightness of each channel. 
Images presented together were processed identically. 

In situ hybridization. In situ hybridization was performed as previously 
described* with the following modifications. Cryosections (12-16 jm) 
were collected on Superfrost Plus slides. Ten3 probe containing base pairs 
4124-4953 of Ten3 mRNA (located in exon 23; mRNA accession number 
NM_011857.3) was PCR amplified from genomic DNA using the follow- 
ing primers: 5’-GTGGCTAAAAGCCCACTGTTGCC-3’, 5’/-GAATGGCC 
CACTGACCTCGCG-3’. PCR product was cloned into pCR4-TOPO (K457502). 
RNA probes were transcribed using T3 or T7 RNA polymerases. For colorimetric 
development (Fig. 1d), Sigma-FAST NBT/BCIP (Sigma, B5655) was used. For 
combined in situ hybridization with immunostaining (Extended Data Fig. 1)) 
hybridized slides were incubated overnight with alkaline phosphatase-conjugated 
anti-DIG antibody (1:1,000, Roche Applied Science, 1093274) and rabbit anti- 
PCP4 (1:200, Sigma, HPA005792). Signals were developed using Fast Red TR/ 
Naphthol As-MX (Sigma-Aldrich, F4523) and 488 donkey anti-rabbit secondary 
(1:200, Jackson Immunoresearch). Sections were imaged on Zeiss epifluorescence 
and 780 confocal microscopes. For fluorescent in situ hybridization for quantifi- 
cation, 12,1m sections of P10 brain were used. Sagittal sections were taken from 
dorsal hippocampus, and horizontal sections were from middle levels of the dorsal- 
ventral axis. After hybridization, washing, and blocking, slides were incubated 
with alkaline phosphatase-conjugated anti-DIG antibody (1:1,000) for 1h and 
developed with Fast Red TR/Naphthol As-MxX for 45 min. Sections were imaged at 
10x magnification on a Zeiss epifluorescence scope. Fluorescence intensity meas- 
urements were taken on unprocessed images in FIJI and data were processed using 
custom MATLAB scripts. For CA1 quantifications, a 45-pixel-wide segmented line 
with spline fit was drawn along CA1, from proximal to distal. Intensity values along 
the CA1 line were measured using the Plot Profile command. For quantification of 
subiculum, a 125-pixel-wide segmented line with spline fit was drawn along subic- 
ulum, from proximal to distal. The bottom of the line was aligned to the superficial 
border of the subicular pyramidal cell layer; the proximal border of subiculum was 
defined in the same way as the distal border of CA1: that is, the point where the 
compact CA1 cell body layer ended. The distal subiculum border was defined by 
the increase in cell density of presubiculum layer 2. Because of the narrowing of 
the distal subicular cell layer, the line was drawn until the Ten3 fluorescent signal 
became narrower than the 125-pixel line, to avoid averaging areas with zero signal. 
This resulted in the exclusion of a small portion of the Ten3 signal in the most 
distal subiculum. Identical background subtraction was performed on all inten- 
sity traces, on the basis of the signal in sense control slides. The intensity traces 
were binned into 100 bins from proximal to distal, and the traces for individual 
sections were normalized to a peak of 100. Traces for individual sections were then 
averaged to produce intensity plots. Numbers of sections were as follows: sagittal 
and horizontal CA1, 12 sections, 4 mice; sagittal and horizontal subiculum, 
14 sections, 4 mice. 

Electrophysiology. At P14—P19, the brain was removed and placed in ice-cold car- 
bogenated slicing artificial cerebrospinal fluid that contained (in mM) 83 NaCl, 2.5 
KCl, 1 NaH2PO,, 26.2 NaHCO3;, 22 glucose, 72 sucrose, 0.5 CaCl;, and 3.3 MgSOx. 
Coronal sections (300 1m) were cut on a Leica vibratome. Slices were allowed to 
recover at 31°C for 40 min and then at 23-25 °C for 30 min to 6h. Slices were then 
placed in carbogenated recording artificial cerebrospinal fluid (119 NaCl, 2.5 KCl, 
26 NaHCOs, 1 NaH2POg, 1.5 MgSOug, 2.5 CaCh, and 11 glucose, all in mM) that 
contained 501M picrotoxin (Sigma). Signals were recorded with a 5 x gain, low- 
pass filtered at 2 kHz, digitized at 10 kHz (Molecular Devices Multiclamp 700B), 
and analysed with pClamp 10 (Molecular Devices). Whole-cell recordings were 
made using 3-5 MQ pipettes filled with an internal solution that contained (inmM) 
123 caesium gluconate, 8 NaCl, 1 CaCl, 10 EGTA, 10 HEPES, and 10 glucose, 
pH 7.3, with CsOH, 280-290 mOsm. Series resistance (R,) and input resistance 
(Rin) were monitored throughout the experiment by measuring the capacitive tran- 
sient and steady-state deflection, respectively, in response to a —5 mV test pulse. Rg 
and Ri, did not differ across experimental conditions (proximal subiculum record- 
ings: R, wild type= 4.397 + 0.157 MQ, n= 12; Ten3 knockout = 4.865 + 0.417 MQ, 
n=11; P=0.289, Student's t-test; Ri, in wild type = 201.6 + 18.7 MQ, n= 12; 
Ten3 knockout = 253.1 + 28.64MQ, n=11; P=0.141, Student’s t-test; 
distal subiculum recordings: R, wild type = 4.409 + 0.114MQ, n= 14; Ten3 
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knockout = 4.561 + 0.213 MQ, n=9; P=0.502, Student’s t-test; Rin wild 
type = 221.8 + 29.57 MQ, n= 14; Ten3 knockout = 254.6 + 34.83 MQ, n= 9; 
P=0.486, Student's t-test). Responses were evoked by a platinum two-contact 
cluster electrode (FHC) placed in the stratum oriens—alveus above distal CA1, 
at the stimulation intensities listed in the figure. Cells were clamped at —-60 mV 
to measure GluA (AMPA-receptor)-mediated EPSC. Analysis was based on the 
average of ten sweeps. The inter-stimulus interval for paired pulse ratio measure- 
ments was 50 ms. 

Stereotactic surgery. For stereotactic injections in neonatal mice, hypothermia 
was used for anaesthesia. Coordinates were zeroed from lambda. For CA1, coor- 
dinates used were 1 mm lateral, 1.1 mm anterior, 0.85 mm ventral from skull. For 
subiculum, coordinates were 1.1 mm lateral, 0.3 mm anterior, 0.8 ventral from skull. 
For developmental axon tracing, BDA (Thermo Fisher D1956, 5% in PBS) was 
iontophoretically injected into CA1 using a Digital Midgard Precision Current 
Source (Stoelting, 51595), pulled glass micropipettes (World Precision Instruments, 
1B120F4) with tips broken to outer diameter of 10j1m, and current parameters 
5A, alternating 7-s on, 7-s off, for 2min. Mice were perfused between 2 and 
8 days post-injection for histology as above. For conditional knockout experiments, 
100 nl of lentivirus containing Ub-GFP-cre (4.6 x 10° to 6.8 x 10° infectious units 
per microlitre, Neuroscience Gene Vector and Virus core, Stanford University) was 
injected at 100 nl min~!. Subsequent injections of PHA-L or AAV8-hSyn1-FLEx- 
mGFP-2a-synaptophysin-mRuby*™ were performed at P35-P40. For whole-animal 
knockout studies, PHA-L was injected between P30 and P45. Mice were anaesthe- 
tized with ketamine-xylazine (Lloyd Laboratories) or isoflurane and immobilized 
in stereotactic apparatus (Kopf). Coordinates were proximal CA1 (from bregma) 
1.4mm lateral, 1.25-1.3 mm posterior, 1.1-1.2 mm ventral from brain surface; 
distal CA1 1.4mm lateral, 1.8-2 mm posterior, 1.1 mm ventral. For PHA-L ion- 
tophoresis, 2.5% (by weight) PHA-L (Vector Labs, L-1110) solution in PBS, pH 8, 
was iontophoretically injected using current parameters 2 1A, alternating 7-s on, 
7-s off, for 1 min with pipette tips broken to an outer diameter of 6-12,1m. For AAV 
iontophoresis, current parameters were 2 1A, 7-s on, 7-s off, for 2 min. Mice were 
perfused 5 (PHA-L) or 21 (AAV) days later and processed for PHA-L and Cre, or 
Cre, GFP, and mCherry immunostaining as described above. For entorhinal cortex 
axon tracing, CD-1 mice aged P40-P55 were used. AAV1-CMV-GFP (40-50 nl) 
was injected at 20 nl min“! into the coordinates (from bregma): medial entorhi- 
nal 2.8 mm lateral, 4.75 mm posterior, 2.75 mm ventral from brain surface; lateral 
entorhinal 4.6 mm lateral, 3.5mm posterior, 3 mm ventral from brain surface. Mice 
were perfused 2 weeks later. 

Imaging and data analysis for CA1 axon tracing. For whole-animal mutant trac- 
ing, every other 60-\1m parasagittal section was stained and analysed. Images of 
injections were taken at 5x magnification on a Zeiss epiFluorescence scope and 
projections were imaged at 10x. Mice with labelling too faint to image or with 
retrogradely labelled cells were excluded. Images were acquired at identical gain 
and offset settings, but because injection and projection intensities varied between 
mice, exposure times were adjusted to avoid saturation. Fluorescence intensity 
measurements were taken on unprocessed images in FIJI and data were processed 
using custom MATLAB scripts. For quantification of injections, a 30-pixel-wide 
segmented line with spline fit was drawn along CA1, from proximal to distal. DAPI 
and Cre signals (from Ten3‘”) were used to identify the proximal end of CA1, and 
DAPI signal was used to identify the distal end of CA1. Intensity values along 
the CA1 line were measured using the Plot Profile command. For quantification 
of projections, a 180-pixel-wide segmented line with spline fit was drawn along 
the subiculum, from proximal to distal. The bottom of the line was aligned to the 
superficial border of subiculum pyramidal cell layer; the proximal border of subic- 
ulum was defined in the same way as the distal border of CA1: that is, the point 
where the compact CA1 cell body layer ended. The distal border of the subiculum 
was defined as the point where layer 2 of the presubiculum became obvious as an 
increase in DAPI density in the superficial cell body layer. The 180-pixel-wide 
curved line was straightened using the Straighten function under Selection. For 
injections that labelled CA2 and CA1, axons from CA2 were present near the distal 
border of CA1. Area selections were drawn around these axons and intensities were 
set to zero using the Clear function. Pixel values at each position were imported 
into MATLAB, and the 180 pixels for each point along the proximal-distal axis 
were averaged to give the fluorescence intensity at that coordinate of the proxi- 
mal-distal axis. From this point forward, injections and projections were processed 
in the same way. A manual background subtraction was performed by selecting a 
value corresponding to an area of the trace that was only background signal, and 
this value was subtracted from all points on the trace. Any intensity values below 
zero after subtraction were set to 0. The length of the CA1 or subiculum axis 
was divided into 100 equally sized bins and intensity values were averaged within 
each bin. For each injection or projection, the three sections with the highest total 
labelling were combined by summing the three intensity values at each binned 
position. This summed intensity trace was then normalized to a peak value of 100. 


The mean position of the trace was calculated by multiplying the intensity value 
by the bin position, summing across the entire axis, and dividing by the sum of 
the intensity values. The width of the trace was defined as the full width-at-half- 
maximum. Traces were averaged across mice to produce the plots in Fig. 2e and 
Extended Data Fig. 5c, d, and the shaded error traces represent the standard error 
of the mean fluorescence intensity at that bin. For mean and width quantifications 
in Fig. 2f, g, the means were compared using two-tailed t-tests. For quantification 
of tracing across the proximal-distal axis of CA1 (Extended Data Fig. 5), injections 
and projections were processed as above. Values were combined into five groups 
for analysis and averaging based on the injection mean positions to the right of 
Extended Data Fig. 5d. Projection widths were compared using two-way analysis 
of variance (ANOVA) followed by a Sidék multiple comparisons test; multiplicity- 
adjusted P values are reported. Projection mean regression lines were generated 
and compared using Prism 7 (GraphPad). All statistical analyses were performed 
in Prism 7 (GraphPad). 

Data analysis for conditional knockout experiments. Tissue processing and 
imaging for CA] and subiculum conditional knockout experiments were the same 
as for whole-animal mutants, with the following modifications. For subiculum 
conditional knockout, mice were excluded if lentivirus-GFP-Cre cells spread into 
the proximal half of CA1 (more than two cells in proximal CA1 per section, or 
if any PHA-L* cells were GEP—Cre*). Some mice had GFP-Cre expression in 
the distal CA1; however, because these cells express low to no Ten3, these mice 
were included as subiculum conditional knockouts. GFP-Cre signal in subiculum 
was measured and quantified along with the PHA-L labelling intensity. Heatmaps 
were generated in MATLAB using the ‘imagesc’ function, and three-dimensional 
plots were generated in MATLAB using the ‘surf’ function. For CA1 conditional 
knockout, mice were excluded if lentivirus-GFP-Cre cells spread into the dis- 
tal half of subiculum. The mRuby signal from AAV8-hSyn1-FLEx-mGFP-2a- 
synaptophysin-mRuby was used for injection quantifications and the mGFP for 
projection quantifications. In addition, we stained for Cre in the 647 nm channel 
instead of using the GFP-Cre signal, because of the presence of mGFP. mGFP 
and GFP-Cre signal could be unambiguously distinguished by staining for 
Cre. In cases where the GFP-Cre signal overlapped with the mGFP signal, the Cre 
staining in the 647 nm channel was used to mask out Cre* pixels, so the GFP-Cre 
signal was not counted as mGFP from CA1 axons. Analysis and quantification 
for CA1 conditional knockout projection mean and width was identical to the 
whole-animal knockout. For both CA1 and subiculum conditional knockouts, 
mice were excluded that had injection mean positions greater than 30 in CA1. 
Hippocampus cDNA sequencing. Freshly dissected brains from P8 CD-1 mice 
were sliced horizontally into 1 mm slices using a tissue slicer. CA1 and subiculum 
were microdissected from individual slices using tungsten needles. Tissue from 
five mice was pooled, and total RNA was extracted using TRIzol reagent (Thermo 
Fisher, 15596018) and chloroform followed by purification using an RNeasy kit 
(Qiagen). DNA was removed by on-column digestion with DNase (Qiagen) for 
15min. cDNA was produced using a Superscript III First-Strand Synthesis System 
(Thermo Fisher, 18080-051) and a gene-specific primer located 3’ of splice site B 
(5'‘-ATCAGAGACGTACAAATCTCCGG-3’) followed by PCR for 30 cycles with 
Q5 polymerase (NEB) using the same primer and a primer located 5’ of splice site A: 
5'-TGTAATCAGAGAGCTTGCCACC-3’. PCR products were cloned into pCR- 
Blunt II-TOPO (Thermo Fisher, 450245), individual colonies were miniprepped 
and sequenced using T7 and SP6 sequencing primers (Sequetech), and the status at 
splice sites A and B was assigned to individual clones using Geneious 10 software. 
DNA constructs. Full-length mouse Ten3 cDNA was obtained from Source 
Bioscience, clone identifier IRCKp5014M2020Q (accession number BC145284.1). 
This sequence lacks all alternatively spliced exons (corresponding to isoform AgBp). 
Haemagglutinin-tag was added and cDNA was cloned into pCDNA3.1(—) 
(Thermo Fisher, V79520) in two steps using a NEBuilder HiFi Assembly Kit (NEB, 
E2621) and NEB Stable competent cells (NEB, C3040H). Phusion Taq polymerase 
(NEB M0530) was used to amplify the 5’ portion for Ten3 cDNA, adding a 
5' haemagglutinin-tag and Kozak sequence. Primers were as follows: 5/-TAGACT 
CGAGCGGCCGCCACCATGTACCCATACGATGTTCCAGAT TACGCTGATG 
TGAAGGAACGCAGG-3’ and 5’-TGGAATTCTGCAGATATCCTGAGGAT 
CAGGCAA-3’. NEBuilder HiFi Assembly mix was used to insert this PCR frag- 
ment into pCDNA3.1(-) digested with EcoRV and NotI. The assembled 
product was subsequently digested with EcoRV and KpnI, and the HiFi Assembly 
mix was used to insert the 3’ end of Ten3 cDNA, amplified using the following 
primers: 5'’-CCTGATCCTCAGGATATCATTAG-3’ and 5‘-TTAAACTTAAGCT 
TGGTACCTTACCTCTTGCCGATCTCA-3’. The assembled product was fully 
sequenced. Additional isoforms were generated using a Q5 Site-Directed 
Mutagenesis Kit (NEB E0554S). AB; was generated from AoBy using the following 
primers: 5’‘-TTAGACATAGTAGCAACCCAGCTCACAG-3’ and 5/-AATCTTT 
ATTTCTTAGTTCTAAAACACTTGTCACATTC-3’. A;Bo and A,B, were 
generated from AoBo and AoBy, respectively, using the following primers: 
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5'/-AAGATAGTTAAAGAGGGTTGCCCCGGCTTGT-3’ and 5’-ATCCAA 
ATAGTGAGCAATTGTGCAGTGCTCTCCGTTC-3’. AB; was generated from 
A,B, using the following primers: 5‘-GATATAAAGAGGGTTGCCCCGGCTTGT 3’ 
and 5’ CTATCTTGTCTTTAACTATCTTATCCAAATAGTGAGCAAT 
TGTGCAG-3’. A3B; was generated from A,B, using the following primers: 
5’-GGATATAAAGAGGGT TGCCCCGGCTTGT-3’ and 5’-TATCTTGTCTG 
CTTTAACTATCTTATCCAAATAGTGAGCAATTGTGCAG-3’, 

All the haemagglutinin-tagged Ten3-isoform open reading frames were then 

cloned into a dual transcription unit vector under the Efla promoter. The vector 
also contained mCherry or GFP under the CMV promoter. Primers for ampli- 
fying the haemagglutinin-tagged open reading frames were 5’-TACAAAAAA 
GCAGGCTGCCACCGTCGACATGTACCCATACGATGTTC-3’ and 5’-TTGTA 
CAAGAAAGCTGGGTGGATCCTTACCTCTTGCCGATC-3’. NEBuilder 
HiFi Assembly mix was used to insert this PCR fragment into the destination 
vector digested with BamHI and Sall. CMV-mCherry and CMV-GFP were 
generated by cloning eGFP and mCherry into Nhel and HindIII-digested 
pCDNA3.1(—) using a NEBuilder HiFi Assembly Kit. eGFP and mCherry were 
amplified from source plasmids using the following primers: 5‘-GGAGACCCA 
AGCTGGCTAGCCACCATGGTGAGCAAGGGCGA~-3’ and 5’-ATCAGCGG 
TTTAAACTTAAGCTTTTACTTGTACAGCTCGTCCA-3’. The expression 
construct for human latrophilin-3 was published previously™! and provided by 
R. Sando and T. Siidhof. 
Single isoform K562 cell aggregation assay. K562 cells (American Type Culture 
Collection (ATCC) CCL-243, lot 61978333) were grown in RPMI-1640 (Gibco 
11875-093 with 10% FBS (Gibco A3160501) and 1x penicillin-streptomycin 
(Gibco 15140122). The lot was authenticated and tested for mycoplasma by ATCC. 
Cells were electroporated using a Neon Transfection system (Thermo Fisher 
MPK5000) with 100-11 tips. DNA was prepared using a HiPure Plasmid Filter 
MaxiPrep Kit (Thermo Fisher K210017). Fifteen micrograms of the Efla-Ten3 
CMV-GFP dual transcription unit plasmid expressing a particular isoform were 
used for each electroporation. For the GFP-only condition, 6.43 1g of Efla-empty 
CMV-GFP along with 8.57 1g of empty pCDNA3.1 were used. Two million cells 
were washed once in 5 ml PBS, resuspended in 110 1l buffer R, mixed with DNA, 
and electroporated using the following settings: 1,450 V, 10 ms, three pulses, and 
added to 5 ml of warm RPMI-1640 with 10% FBS. Cells were incubated in six-well 
plates at 37°C and 5% CO3. Sixteen to 20h later, cells were centrifuged at 200g for 
3 min, resuspended in 1 ml RPMI-1640 + 10% FBS, and incubated for 15 min 
at 37°C with 0.1 mg ml“! DNase (Worthington LS002060). Cells were washed 
twice with RPMI-1640 + 10% FBS, then resuspended in aggregation medium. 
Aggregation medium was neural growth medium: Neurobasal-A, 4% B27, 2mM 
glutamine, 10% FBS, 20mM HEPES. Cells were passed through a 40-j.m 
cell strainer, counted, and added to wells of a 24-well plate at 4 x 10° cells per 
millilitre in 1 ml aggregation medium, and the plate was placed on a nutator at 
37°C for 1.5h. After 1.5h, cells were gently transferred to 2 ml of aggregation 
media in a six-well plate and imaged with a Nikon Eclipse Ti microscope using a 
4x lens. Five images were taken for each well. Images were thresholded in FIJI, 
and Analyze Particles was used to measure particle number and size. Particle sizes 
from three separate experiments were combined, and particles below 600 um?, 
the size of a large GFP* cell in control wells, were filtered out to remove single 
cells. Combined particle sizes were then compared with GFP-only control using a 
Kruskal-Wallis test followed by Dunn's multiple comparisons test. A family-wide 
significance level was set to 0.05, and multiplicity-adjusted P values were reported. 
For haemagglutinin immunostaining following aggregation, cells were fixed in 
4% paraformaldehyde for 15 min, centrifuged for 3 min at 200g, rinsed once with 
PBS, and resuspended in 30% sucrose in PBS. Cells were spun onto Superfrost 
Plus slides using a cytospin® then incubated in the following solutions, at room 
temperature except for primary antibody: 2 x 5 min in PBS, 10 min in 0.1% Triton 
X-100 in PBS, 3 x 5min in PBS, 30 min in 10% NDS in PBS, overnight in primary 
antibody (Ms anti-haemagglutinin, Covance MMS-101R) at 4°C in 5% NDS in 
PBS, 3 x 5 min in PBS, 2h in secondary antibody (Donkey Alexa 647 anti-mouse, 
Jackson ImmunoResearch) in 5% NDS in PBS, 3 x 5 min in PBS, 5 min in DAPI, 
and 5 min in PBS. Aggregates were imaged at 40 x magnification on a Zeiss 780 
confocal microscope. 
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K562 mixed cell aggregation assay. For the Ten3 isoform mixing assay (Extended 
Data Fig. 10), electroporations and aggregation assay were performed as above 
with the following modifications. The mCherry versions of the dual promoter 
constructs were used where appropriate. The two populations of cells were mixed at 
2 x 10° each in 1 ml aggregation media, for a final cell concentration of 4 x 10° cells 
per millilitre. Phase-contrast images of the cells and aggregates were also taken. For 
quantification, aggregates were identified from the phase image, then identified as 
mixed, GFP-only, or mCherry-only, on the basis of the merged fluorescence image. 
Mixed aggregates had to contain at least three cells of each colour, to avoid identify- 
ing single population aggregates as mixed because of chance overlap with a single 
cell in the other channel. At least 100 aggregates across three separate experiments 
were counted. For the latrophilin-3 and Ten3 mixing experiment, conditions and 
analysis differed as described below. For each electroporation, 5 1g CMV-GFP 
DNA was co-electroporated with 12 jg CMV-Ten3 or 121g empty pCDNA3.1, or 
5 ug CMV-mCherry and 1.75 1g CMV-latrophilin-3. Aggregation media were the 
same as above except with 1.25% FBS rather than 10%. After 1.5h of mixing, 31 
aliquots were taken from the wells, spotted onto slides, and imaged at 5x magni- 
fication on a Zeiss epifluorescence scope. Three or four aliquots were taken per 
well, and two images were taken per aliquot. For aggregate size quantification, the 
GFP and mCherry channels were thresholded, combined, and Analyze Particles 
was used on the combined binary image to measure particle number and size. 
Particle sizes from three separate experiments were combined, and particles below 
a 600m”, the size of a large GFP* cell in control wells, were filtered out to remove 
single cells. Combined particle sizes were then compared with latrophilin-3 and 
GFP-only control using a Kruskal-Wallis test followed by Dunn's multiple compar- 
isons test. A family-wide significance level was set to 0.05, and multiplicity-adjusted 
P values were reported. 

Code availability. Custom MATLAB scripts used in analysing data are available 
from the corresponding author upon reasonable request. 

Data availability. All data supporting the findings reported in this study are avail- 
able from the corresponding author upon reasonable request. The Ten3 isoform 
cDNA sequences have been deposited in GenBank under accession numbers 
MG387139-MG387 146. 
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Extended Data Figure 1 | Ten3 staining details and controls. a, Diagram 
of Ten3 protein showing location of antibody epitopes, specific domains, 
and region deleted in the Ten3“* mutant. In Ten3“, a neomycin resistance 
cassette (Neo) replaces 110 base pairs of sequence directly N-terminal to 
the transmembrane domain. The Ten3IC antibody was used for all Ten3 
stainings in the paper except d and e of this figure. Scale bar, 200 amino acids. 
b, c, Ten3IC staining (red) on P9 horizontal sections of Ten3" (b) and 
mutant Ten34“/“4 (c) brains, showing loss of staining in Ten3 mutants. 

Note that the Ten3IC epitope is located N-terminal to A4, suggesting that 

in Ten3“““ either the mRNA undergoes nonsense-mediated decay or the 
truncated protein is not stable. d, e, Ten3EC staining on P10 horizontal 
sections of Ten3? (d) and mutant Ten34“/4 (e) brains. Staining has a 
higher background than Ten3IC antibody, but signal is present in proximal 
CAI (arrow) and distal subiculum (arrowheads), similar to Ten3IC, which 
is absent in the knockout (open arrow/arrowheads). f, Ten3 staining 


YD repeats 


200 a.a. 


Ten3* 


(red) on P10 horizontal section with boxes around regions magnified in 
g-i. g, Ten3 staining in dentate gyrus from f, CA3, and CA1. Intensity 

was increased to highlight Ten3 signal in axons and dendrites. Ten3 in 
stratum radiatum of proximal CA1 (arrowhead) is most probably from 
CA1 dendrites, since CA3 cells, the major source of axons in this layer, did 
not express Ten3 mRNA (Fig. 1b). Ten3 in the molecular layers of dentate 
gyrus and CA3 (arrows) is probably contributed by the axons of MEC layer 
II neurons®, since dentate gyrus and CA3 neurons did not express Ten3 
mRNA (Fig. 1b). h, Proximal CA1 pyramidal cell layer from f, showing 
Ten3 signal in cell bodies. i, Proximal CA1 stratum lacunosum-moleculare 
from f, showing Ten3 signal in the region where MEC axons synapse onto 
CAI pyramidal neuron dendrites. j, In situ hybridization on P9 horizontal 
section for Ten3 mRNA (magenta) combined with immunostaining for 
PCP4 (green), a marker of CA2 neurons’’. No overlap between Ten3 and 
PCP4 was observed. Scale bars, 200 1m in bg, j, and 100 1m in h, i. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


DAPI b 


Ten3 mRNA 


ARTICLE 


Subiculum 


ive} 
oO 


Normalized 
fluorescence intensity 


ie} 
oO 


Normalized 
fluorescence intensity 


0 20 
Proximal 


Extended Data Figure 2 | Distribution of Ten3 mRNA in sagittal 
sections. a, In situ hybridization for Ten3 mRNA on sagittal section 

of P10 brain. Top, merged image with Ten3 mRNA signal in red and 

DAPI in blue; middle, Ten3 mRNA signal alone; bottom: DAPI signal 
alone. b, Top: magnified image of Ten3 in situ hybridization in CA1; 
bottom: quantification of Ten3 mRNA along the proximal-distal axis of 
CA1 (n= 12 sections, four mice), showing a graded signal that peaks in 
proximal CA1 and decreases to a minimum in distal CA1. Proximal-distal 
axis is divided into 100 bins, with 1 being most proximal and 100 most 


40 
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distal. Shaded curves represent mean + s.e.m. c, Top: magnified image of 
Ten3 in situ hybridization in subiculum; bottom: quantification of Ten3 
mRNA along the proximal-distal axis of subiculum (n= 14 sections, 

four mice) showing a graded signal that peaks in distal subiculum and 
decreases to a minimum in proximal subiculum. The distributions in CA1 
and subiculum are similarly shaped but reversed along the proximal-distal 
axis, reflecting the graded topographic connections along this axis (see 
Fig. 2 and Extended Data Fig. 5c). Scale bars, 200j1m in all panels. 
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Extended Data Figure 3 | Ten3 expression and topography details. 

a, Ten3 staining (red) on P10 horizontal section. Dotted rectangles 
highlight staining in the hippocampal region and anteroventral thalamic 
nucleus, which are magnified in b and c. b, The parahippocampal region 
from a, showing expression of Ten3 relative to the proximal-distal (P-D) 
axes (arrows) in the presubiculum and parasubiculum. The connectivity of 
these regions is complex**?, but seems to be consistent with preferential 
connectivity between Ten3-expressing subregions. Ten3 is expressed 

in distal presubiculum (close to parasubiculum), which projects to 

MEC near the parasubicular border*’, and receives projections from 
distal subiculum®, both Ten3-high subregions. Ten3 is also expressed 

in proximal parasubiculum, which projects to MEC** and receives 
projections from distal subiculum®, again both Ten3-high subregions. 

c, The anteroventral thalamus from a, showing intense Ten3 staining 

in the anterior and lateral division of the anteroventral thalamic nucleus 
(AVT, outlined). Distal subiculum, another Ten3-high region, projects 
to the anteroventral thalamic nucleus, whereas proximal subiculum 
projects to the anteromedial thalamus®'. d, In situ hybridization for 

Ten3 mRNA (green) on P9 horizontal section from similar location as c. 


e, Ten3 staining (red) on P10 horizontal section, more ventral than a. 
Dotted rectangle highlights intense staining in the medial mammillary 
nucleus, which is magnified in f. f, Medial mammillary nucleus from e, 
showing Ten3 labelling in the lateral division of the medial mammillary 
nucleus (IMMn). Outlines show location of lateral mammillary nucleus 
(LM), lateral division of the medial mammillary nucleus, and medial 
division of the medial mammillary nucleus (mMMn). Proximal subicular 
neurons project to the medial division of the medial mammillary nucleus, 
whereas Ten3-high distal subicular neurons project to the Ten3-high 
lateral division*”®*-**, The neurons of the lateral division project to the 
Ten3-high anteroventral thalamic nucleus, while medial division neurons 
project to the anteromedial thalamic nucleus*’. g, In situ hybridization for 
Ten3 mRNA (green) on P9 horizontal section from similar location as f. 
Scale bars, 500 1m in a and e; 200 1m elsewhere. In summary, the pattern 
of Ten3-high to Ten3-high connectivity observed for CA1, subiculum, and 
entorhinal cortex seems to extend to many of the topographic projections 
formed between these subregions and the presubiculum, parasubiculum, 
thalamus, and mammillary nucleus. 
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Extended Data Figure 4 | Generation and characterization of Ten3. 

a, Design of Ten3*. Top: region of chromosome 8 containing Ten3 exon 1, 
which contains the start codon (ATG). Middle: targeting construct with 
cre open reading frame inserted directly after the Ten3 start codon. 

cre is followed by a synthetic intron, woodchuck hepatitis virus post- 
transcriptional regulatory element (WPRE), and bovine growth hormone 
polyadenylation sequence (bGH pA) (see key on right). Neo includes 

a phosphoglycerate kinase (PGK) promoter driving the resistance 

gene. Bottom: genomic region after homologous recombination. 


Tengen 


Cre Ten3 Tengcece Cre Ten3 
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The endogenous exon 1 sequence after the start codon is replaced with 
cre. Neo was not removed by flippase (FLP)-mediated recombination 

in the mice used in Fig. 2. b-d, Cre and Ten3 protein expression in P10 
horizontal sections from Ten3 mice. White dotted lines highlight 
proximal and distal borders of CA1 and subiculum. b, c, Cre expression 
(green) mimics the distribution of Ten3 expression (magenta) in Ten3"*! 
mice. d, In Ten3°/? mice, Ten3 immunostaining is absent. Scale bars, 
200 jum. 
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Extended Data Figure 5 | Analysis of CA1—subiculum projections 
with various injection sites in Ten3” control and Ten3*° mice. 

a, b, PHA-L (green) injection in distal CA1 (a) and corresponding 
projection in proximal subiculum (b) in Ten3" mouse. Scale bars, 
200 1m. c, d, Averaged normalized injection (c) and projection (d) 
traces of all Ten3* (black) and Ten3*° (red) mice analysed, binned 
into five groups by the mean position of the injection, and plotted from 
most proximal (top) to most distal (bottom) injections (bin limits and 
number of mice per bin listed on the right of d). Proximal-distal axis 
position is numbered from 1 (most proximal) to 100 (most distal). 
Shaded error curves represent mean = s.e.m. at each bin. e, Projection 
width in subiculum versus injection mean position in CA1 for all mice 


Mean injection position 


Mean injection position 
(% of CA1) 


(% of CA1) 


(Ten3': n= 31, black circles; Ten3*°: n = 38, red triangles). f, Projection 
width data binned by injection mean. Number of mice per bin same as 

d. Projection width was significantly increased in Ten3*° for the three 
most proximal bins. ****P < 0.0001; multiplicity-adjusted P values after 
two-way ANOVA with Sidak’s correction for multiple comparisons. Error 
bars, mean + s.e.m. g, Projection mean position in subiculum versus 
injection mean position in CA] for all mice used (Ten3"™!: n= 31; Ten3*°: 
n= 38), with superimposed linear regression lines (Ten3"*': R? = 0.9812; 
Ten3*°; R? =0.9515). The slopes were significantly different (P< 0.0001), 
indicating a less sharp topography in Ten3*° mice. Bin 1 data (most 
proximal, injection mean 10-25) in c-g are the same data as in 

Fig. 2e-g. 
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Extended Data Figure 6 | Generation and characterization of Ten3"*. FLP-mediated recombination in some of the mice used in Figs 3 and 4. In 
a, Design of Ten3". Top: region of chromosome 8 containing Ten3 exon 4, addition to deleting exon 4, the reading frame 3’ to exon 4 is frame-shifted 
which is 239 base pairs long and encodes 19 of the 21 amino acids in the with respect to the reading frame 5’ to exon 4. b-d, Cre (green) and Ten3 
transmembrane domain. Guide RNA (gRNA) targets shown in red (see (magenta) protein expression in P10 horizontal sections from Ten3fl+ 

key at right). Line 2: targeting construct with loxP sites inserted 5’ and 3’ of — (b), Ten3"* (c), and Ten3*" (d) mice. Ten3 staining is absent in Ten3°°! 
exon 4. Neo includes a PGK promoter driving the resistance gene. Line 3: mice. White dotted lines highlight proximal and distal borders of CA1 and 


genomic region after homology-directed repair. Bottom: deletion of exon 4 — subiculum. Scale bars, 200 jim. 
after Cre-mediated recombination between loxP sites. Neo was removed by 
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Extended Data Figure 7 | Time course of CA1—>subiculum projection shows a magnified image of the section containing the highest density 
development. Sagittal sections from mice injected with BDA (green) in of projection at subiculum. Dashed lines mark proximal and distal CA1 
CA1 at PO, and perfused for staining at P2 (a, b), P4 (c, d), P6 (e, f), or borders in the left panels, and proximal and distal subicular borders in the 
P8 (g, h). Two mice are shown for each time point with a pair of images right panels. CA1 axons are largely absent at subiculum at P2, and increase 
per mouse. Within each pair, the left image shows the section that intensity from P4 to P8. Scale bars, 200 jm. 


contains the centre of the injection site at CA1, whereas the right image 
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Extended Data Figure 8 | Subiculum conditional knockout plots. 
a, b, Plots from Ten3" (a) and Ten3""" (b) mice with minimal GFP-Cre 
expression. Heatmaps show normalized PHA-L fluorescence intensity 
(red, left) and normalized GFP-Cre intensity (green, middle) in 

subiculum, same mice. Each row is one section, 120m between rows, 
colour bars shown below a, and proximal-distal position is on the x axis. 
Surface plots are to the right of the corresponding heatmaps, showing 


L 


M 


AR 


ARTICLE 


GFP-Cro 


Ten3™ 
PHA-L 


vu 
is) 


| 


Tens” 


PHA-L GEP-Cre 


IAANERRAR HERR 
S>SCLELPEER FEEBED 


PHA-L fluorescence intensity as height and GFP-Cre fluorescence 
intensity according to the colour map shown below a. P, proximal; D, 
distal; M, medial; L, lateral. Projections are similar between Ten3"? and 
Ten3""", c, d, Plots from Ten3? (c) and Ten3 (d) mice with high 
GFP-Cre expression in subiculum. In Ten3!!! mice, PHA-L signal is 
decreased in GFP-Cre regions. Number of mice shown in Fig. 4. 
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Extended Data Figure 9 | Latrophilin-3 and Ten3 aggregation 

assay. a, Images from aggregation assay with cells co-transfected with 
latrophilin-3 (Lphn3) and mCherry (magenta) mixed with cells co- 
transfected with GFP and empty vector (left), ApBo isoform of Ten3 
(middle), or A,B; isoform of Ten3 (right). Scale bar, 200 1m, applies to all 
images. b, Quantification of aggregate sizes pooled from three biological 
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replicates. Dashed red line shows cutoff at 600 jum”, the size of a large 
GFP* cell from the control images. Asterisks denote significance from 
Dunn's multiple comparisons test after a Kruskal-Wallis test, comparing 
all conditions with the Lphn3 and control mix, Lphn3 + control: n = 32 
particles above threshold; Lphn3 + ApBo: n = 172; Lphn3 + A,By: n= 159. 
NS, not significant; ****P < 0.0001, multiplicity-adjusted P values. 
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combinations of K562 cells expressing the AiB;, A2B,, or A3B; Ten3 isoform. No mixed aggregates were observed in combinations of Ten3- 
isoform along with GFP or mCherry. Scale bar in bottom right panel, expressing cells with cells expressing mCherry alone, confirming that the 
200m, applies to all images. b, Quantification of aggregates observed aggregation is Ten3-dependent and not due to an endogenously expressed 
in three biological replicates of the aggregation experiment in a. At least interaction partner. Fractions in parentheses indicate aggregates of a 
100 aggregates were counted across the three replicates in each of the 10 particular type out of all aggregates counted in that condition. 


mixing conditions, except for the GFP-alone and mCherry-alone controls, 
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A parsec-scale optical jet from a massive young star 
in the Large Magellanic Cloud 


Anna F. McLeod!, Megan Reiter’, Rolf Kuiper’, Pamela D. Klaassen* & Christopher J. Evans* 


Highly collimated parsec-scale jets, which are generally linked to 
the presence of an accretion disk, are commonly observed in low- 
mass young stellar objects”. In the past two decades, a few of these 
jets have been directly (or indirectly) observed from higher-mass 
(larger than eight solar masses) young stellar objects*’, adding 
to the growing evidence that disk-mediated accretion also occurs 
in high-mass stars*", the formation mechanism of which is still 
poorly understood. Of the observed jets from massive young stars, 
none is in the optical regime (massive young stars are typically 
highly obscured by their natal material), and none is found outside 
of the Milky Way. Here we report observations of HH 1177, an 
optical ionized jet that originates from a massive young stellar 
object located in the Large Magellanic Cloud. The jet is highly 
collimated over its entire measured length of at least ten parsecs 
and has a bipolar geometry. The presence of a jet indicates ongoing, 
disk-mediated accretion and, together with the high degree of 
collimation, implies that this system is probably formed through a 
scaled-up version of the formation mechanism of low-mass stars. We 
conclude that the physics that govern jet launching and collimation 
is independent of stellar mass. 

Ata distance of 50 kpc, the Large Magellanic Cloud (LMC) is an ideal 
search location for high-mass (M > 8M.5, where Mog is the mass of the 
Sun) young stellar object (MYSO) jets. It is close enough to resolve 
MYSOs and their jets; it has a nearly face-on orientation, allowing a 
convenient viewing angle; it is undergoing active star formation; its 
low dust content makes it an environment in which the feedback 
effect of strong ionizing radiation is enhanced’ and radiation pressure 
suppressed; and it is a low-metallicity environment, allowing a resolved 
probe of star formation in conditions similar to those of earlier epochs 
of the Universe. Current MYSO models are based on Milky Way obser- 
vations, so the LMC allows the analysis of the effect of a different envi- 
ronment on the formation and propagation of jets from MYSOs. Until 
now, only a handful of outflows from massive stars in the LMC!3!4 
have been reported, all of which have been detected in the radio regime 
with the Atacama Large Millimeter/submillimeter Array (ALMA). The 
driving sources of these outflows are still deeply embedded in their 
natal molecular cloud cores and the measured outflows are on sub- 
parsec scales, with no optical jet counterparts. 

The MYSO Herbig—Haro (HH) flow! reported here was detected 
in observations of the ionized star-forming region LMC N180 made 
with the optical integral-field spectrograph Multi Unit Spectroscopic 
Explorer (MUSE)!° at the Very Large Telescope (VLT). LMC N180 
consists of a classical, bubble-shaped H 11 region (see Fig. 1) with a 
radius of about 2.5’ (approximately 36 pc). It is located on the border of 
a giant molecular cloud, it is currently undergoing star formation, and 
it hosts over 50 stars with masses M > 10M,-15Mo, as well as several 
early O-type stars’”, 

The jet itself is externally ionized and is detected as blue- and 
redshifted emission peaks of the Ha line. It spans a total of 10 pc and 
is emerging from a pillar-like structure protruding from the southern 


rim of the surrounding ionized star-forming region. The driving source 
of the jet is probably a MYSO of approximately 12M. (corresponding 
to the young stellar object N180-24 in ref. 17), which has formed at the 
tip of the pillar. 

The jet-driving MYSO is optically visible at the origin of the blue and 
red lobes; this is unusual, as jet-driving MYSOs are usually not visible in 
the optical wavelength regime owing to the high extinction caused by 
their embedding material. We suggest that the low dust content of the 
LMC has aided our investigation by enhancing the ionization feedback 
from the star on the surrounding matter, thus allowing a situation in 
which the envelope has been dispersed before the disk and the jet has 
escaped the pillar. 

Although the moderate spectral resolution of MUSE and its lim- 
ited coverage of blue wavelengths do not allow the determination of 
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Figure 1 | Three-colour composites of the star-forming region 

LMC N180 and the jet. a, b, Three-colour composites of LMC N180 

(red, [S 11] 6,731 A; green, Ha; blue, [O 111] 5,007 A). b, The red arrows 
point at the bow shocks and the yellow arrow indicates the jet source. 

c, Three-colour composite of the same region as b, where the red and the 
blue correspond to the red and blue Ha emission line peaks, and the green 
corresponds to the image of the collapsed MUSE data cube. d, e, Magnified 
images of the jet shown in b. The three-colour composite in d is the same 
as in a. In e, green corresponds to [S 11] 6,731 A. Inall panels, north is up 
and east is left. 
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Figure 2 | Spectrum of the red and blue jet lobes. a, Co-added spectra 
of the red and blue jet lobes (colour-coded accordingly), extracted from 
circular apertures centred on the green circles (labelled 1 for the red lobe 
and 5 for the blue lobe) in b. The estimated signal-to-noise ratios in the 
spectra were about 92 and 129 for the red and blue lobes, respectively. 


the precise spectral type of the star, together with the estimated mass 
of 12M.j and the absence of He 11 absorption lines, our observations 
indicate that the driving source of the jet is an early B-type star (rather 
than a late O-type star). The jet is associated with HH emission traced 
by bow-shock structures, where the jet lobes terminate (see Fig. 1). The 
bow shocks (which, together with the jet, make up the HH 1177 flow) 
are detected beyond the jet and span 11 pc (0.76’) from end to end, 
making this the second confirmed detection of an HH object beyond 
our own Galaxy", as well as one of the longest jets ever observed. 

The detection of emission line peaks together with a spatially 
resolved continuous structure traced by two distinct velocity com- 
ponents indicates a coherent jet with a spatial orientation in which 
the red lobe is moving away and the blue lobe is moving towards the 
observer. The red lobe extends about 3.5 pc (0.24’) to the southeast of 
the pillar, with a position angle of around 144°. The first portion of the 
red lobe, just below the source (at the position of aperture 4 in Fig. 2), is 
partly obscured by the dominating pillar emission. It consists of a main 
segment and a bright terminal knot in close vicinity to the southern 
bow shock. The blue lobe emerges from the top of the pillar with a 
position angle of about —32° and spans about 6.5 pc (0.45’), although 
its terminal knot is complicated by the projected vicinity of a star. 

The jet is highly collimated along its entire parsec-scale detected 
length. With an apparent half-width of approximately R= 0.1 pc (0.4”) 
throughout, the lower limits for the length-to-half-width ratios are 35 
and 65 for the red and blue lobes, respectively. We note that the jet 
width is an upper limit, as the true width is unresolved in our obser- 
vations, and the length-to-half-width is therefore a strict lower limit 
(see Methods). The measured radial velocities that correspond to 
the wavelength offsets of the red and blue peaks are of the order of 
300-400 km s~', yielding a (non-deprojected) dynamic timescale of the 
jet in the range (2.8-3.7) x 10* yr. Therefore, despite the short lifetimes 
of massive stars, the accretion and jet phases are at least of the order of 
tens of thousands of years. 

Figure 2 shows the continuum-subtracted spectra of the red and 
blue lobes; the red and blue peaks are clearly identified on the right 
and left of the central Ha line. What is seemingly a red peak in the 
blue spectrum corresponds to an OH skyline doublet! at 6,577.183 A 
and 6,577.3863 A. 

From the intensity of the red and blue emission line peaks and the 
estimated width of the jet body, we compute a mass loss rate of the jet of 
about 2.9x 10~°Mz yr! anda resulting mass accretion rate of around 
9.5 x 10°-°Mz yr_!. These rates are comparable to literature values for 
embedded MYSOs”, confirming the large mass of the driving source. 
Furthermore, these values also confirm that the source is young (given 
that the mass loss and accretion rates are expected to decrease with the 
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The strong, central Ha emission comes from the H 11 region, as is evident 
from the diffuse emission in Fig. 1b, where green is the central Ha 
wavelength; this wavelength does not include the red and blue emission 
line peaks and the jet is not visible. 


age of the star”°), supporting our contention that this is an optically 
observed jet from an unobscured MYSO. 

The total (projected) length of HH 1177 together with the detected 
bow shocks (about 11 pc) is comparable to some of the largest HH flows 
observed in the Milky Way*!”. However, traditional HH objects are 
emission-line nebulosities corresponding to shock-excited gas along 
a protostellar outflow, and direct detection of the jet body is possible 
with external ionization”? in massive star-forming regions. In the 
case of HH 1177, the jet itself is detected, allowing the direct meas- 
urement of its length-to-(half) width ratio, extent and velocity, as well 
as a description of its morphology and spatial orientation. The high 
degree of collimation and the measured radial velocities of HH 1177 
are similar to those found in lower-mass stars”>”°. This supports our 
theoretical knowledge of the connection between accretion disks and 
magneto-centrifugal ejection, as well as the physics of jet launching 
and collimation, in which the underlying processes are independent of 
stellar mass*2”2°. In terms of the accretion-ejection link, the detection 
of this large-scale MYSO and jet system indicates regular, active disk 
accretion epochs, which are a key factor in the formation of massive 
stars. These disk accretion epochs are required to circumvent the radi- 
ation pressure problem in the formation of massive stars!”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Optical integral-field spectroscopy. The data used for this analysis are optical 
integral field data from the MUSE instrument!®, which is mounted on the VLT in 
Chile. MUSE has a field of view of about 1’x 1’ and a pixel scale of 0.2”. 

The data were acquired between November and December 2015, and were 

reduced with the MUSE data reduction pipeline” in the EsoRex environment 
using the available standard calibrations for each night. The entire mosaic of 
region LMC N180 consists of 64 single pointings, each observed twice with a 90° 
rotation dither pattern and an exposure time of 90s. The pointings were observed 
in the MUSE nominal wavelength range (4,650-9,300 A; resolving power of the 
spectrograph, 2,000-4,000) and a wide field of view. From the calibrated MUSE 
data cube, we estimate the noise of the spectra from the continuum flux closest to 
the analysed lines (from 6,610 A to 6,660 A). 
Integrated line maps and spectral fitting. The Ha, [S 11] 6,731 A and [O m1] 
5,007 A integrated line maps used to produce the three-colour image of Fig. 1 were 
obtained with the Python package Spectral Cube (http://spectral-cube.readthedocs. 
org) as for MUSE data of other pillar-like objects*°". The jet is not visible in the 
central Ha image; only by searching in the individual slices of the data cube in the 
region around the Ha line does the jet become evident. We therefore produced 
integrated line maps centred on the red and blue emission, as for the main emis- 
sion lines. 

To determine the amplitude and central wavelength of the blue- and red-shifted 
peaks of the Ha line, which trace the two lobes of the bipolar jet, we used the 
Python package Pyspeckit® to fit multiple Gaussian components to the emission 
lines in the region covering the [N 11] and the Ha lines. Because MUSE spectra 
span approximately 4,600 A in wavelength, they cover almost all of the nebular 
emission lines typically found in H 11 regions, as well as a multitude of skylines, 
which are particularly dense in the red parts of the spectra. Given the large number 
of lines, identifying and fitting the continuum across the entire MUSE wavelength 
range can be challenging and may result in an overestimation of the continuum 
and a negative baseline after subtraction. We therefore crop the spectra to the 
range 6,000-6,680 A to allow a more precise estimate of the local continuum. 
The fitted spectra are shown in Extended Data Figs 1 and 2. The fit of the blue 
lobe in apertures 5 and 7 does not appear optimal. Adding a fifth Gaussian 
component centred between the blue and the Ha lines seems to provide a better 
fit for the blue peak, but the resulting values of the fit are within the errors of 
the four-component fit; we therefore do not include the additional component. 
The best-fit parameters are summarized in Extended Data Tables 1 and 2. Error 
estimates are taken from the PySpecKit fitting routine and correspond to the root 
mean square of the residuals. For each spectrum shown, we also state the level 
of noise, which is estimated from the continuum closest to the analysed lines, as 
described above. 

Jet widths and collimation. We measure the (projected) diameter of the two lobes 
by extracting slices across the jet structure perpendicular to the jet direction at 
positions 1, 2, 3, 5, 6 and 8 in Fig. 2b (position 4 corresponds to the part of the red 
lobe covered by the emission from the pillar, and position 7 is contaminated by 
the emission from a nearby star). We estimate the diameter L of the jet body (and 
the half-width R = L/2) at each position by again fitting Gaussians to the obtained 
profiles (as shown in Extended Data Fig. 3). These widths are then compared with 


the jet length shown in Fig. 1 (see main text). However, owing to a seeing-limited 
spatial resolution of 0.6” (0.15 pc) at the distance of the LMC, the true width of 
the jet is unresolved and the values obtained from the described fit are strictly 
upper limits. As a consequence, the resulting length-to-half-width ratios are a 
strict lower limit. 

Mass loss and accretion rate. We compute the mass loss rate by first calculating the 
electron density n, from the measured intensity Iyo,req and Ip4,blue Of the emission 
line peaks and then combining it with the measured radial velocities V and width 
Ly. of the jet body in parsecs™: 


Ne= 15.0(Iuta/Lpe)/? (1) 


Mor = pmyne V0(L/2)°f (2) 


where ju is the mean molecular weight, for which we assume a value of 1.35, 
my, is the proton mass, V the jet velocity and fa geometric filling factor derived 
from the morphology of the images. We measure Ipja,req © (0.224 + 0.001) x 
10°! erg cm ~*s_! arcsec * from a circular aperture on the terminal red knot 
(centred on aperture 1) in the integrated intensity map, and use L ~ (0.08 + 0.04) pc 
to obtain n.& (25.13 +0.01) cm °. Because the width of the jet is not resolved in 
the MUSE data, we use 0.08 pc as the jet width, which is comparable to that of jet 
HH 80-81 (from figure 1 in ref. 21), and a large uncertainty of +50%. Together 
with a radial velocity of V~ (363.79 + 0.46) km s~! and f = 1, we obtain a mass 
loss rate of (1.57 + 0.99) x 10-°M yr~!. Similarly, for the blue lobe, we obtain 
Tro,blue © (0.139 + 0.001) x 10-19 erg cm~*s~! arcsec”? from aperture 5 and a mass 
loss rate of (1.28 +£0.81) x 10°-°Mz yr_|, with V~ (376.56 + 0.46) km s-!. Finally, 
we compute the combined mass loss rate to be about 2.85x 10 °Mz yr_'. Assuming 
a ratio of about 0.3 between the jet mass loss rate and the accretion rate**°, we 
obtain a mass accretion rate of about 9.5x 10° °Mz yr. 

Data availability. The dataset that supports the findings of this study is available 
in the ESO archive (http://archive.eso.org/eso/eso_archive_main.html) under the 
observing programme 096.C-0137(A) executed at the VLT (ESO Paranal, Chile). 
Additional requests can be directed to the corresponding author. 
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Extended Data Figure 1 | Fitted spectra of the red jet lobe. Co-added 
spectra of the red lobe of the jet (extracted from a 2-pixel-radius circular 
aperture centred on the green circles in Fig. 2b), continuum-subtracted 
and fitted with a three-component Gaussian. The spectrum is shown in 
black, the fit in red, the single components are plotted in blue and the 


6560 


LETTER 


Aperture 2 
5000 
N = 109.7 
4000 
3000 
2000 
x 
= 1000 
nw 
sg se aa Bl fet = ae Pe a a | 
6560 6570 6580 6590 6600 
Wavelength (Angstrom) 
Aperture 4 


Flux 


t T T 
6570 6580 6590 6600 


Wavelength (Angstrom) 


T 
6560 


residuals of the fit are shown below each spectrum (the latter two are 
shown with an offset on the y axis for better display). The estimated noise 
Nis stated for each spectrum. The best-fit parameters are summarized in 
Extended Data Table 1. The flux is in units of 10-7° erg s-! A~! cm. 
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Extended Data Figure 2 | Fitted spectra of the blue jet lobe. Co-added 
spectra of the blue lobe of the jet (extracted from a 2-pixel-radius circular 
aperture centred on the green circles in Fig. 2b), continuum-subtracted 
and fitted with a four-component Gaussian. The spectrum is shown in 
black, the fit in red, the single components are plotted in blue and the 


Aperture 6 
N=114.1 


—-3000 T T T T T T T T 
6560 6565 6570 6575 6580 6585 6590 6595 6600 
Wavelength (Angstrom) 
Aperture 8 
5000 


N = 166.3 


Flux 


6560 6565 6570 6575 6580 6585 6590 6595 6600 
Wavelength (Angstrom) 


residuals of the fit are shown below each spectrum (the latter two are 
shown with an offset on the y axis for better display). The estimated noise 
is stated for each spectrum. The best-fit parameters are summarized in 
Extended Data Table 2. The flux is in units of 10-7° erg s-! A~! cm. 
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Extended Data Figure 3 | Diameter of the jet body. Black curves are the 
integrated line map intensity profiles along virtual slits perpendicular to 
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lobes (positions 5, 6 and 8) marked in Fig. 2b. Red curves are the best-fit 
Gaussian profiles; blue diamonds are residuals. 


LETTER 


Extended Data Table 1 | Best-fit parameters of the Gaussian fitting to the red lobe 


Ha Red peak [NII] 
Ap. # Peak Flux Centroid Width Peak Flux Centroid Width Peak Flux Centroid Width 

1 82399.0 6568.0 + 7.5x10° 1.24 3057.0 + 6575.9 + 0.3 21+ 4649.54 6588.8 + 0.1 1.140.1 
+ 416.8 7.7*10° 331.0 0.3 430.6 

2 99262.3+ 6568.1 + 7.5x10° 1.154 2184.6 + 6576.5 + 0.4 1.74 5604.3 + 6588.8 + 0.1 1.2+0.1 
570.9 7.7*10° 466.8 0.4 554.9 

3 80343.2 + 6568.0 + 7.2x10° 1.24 2027.6 + 6576.3 + 0.3 1.44 4281.44 6588.8 + 0.1 1.140.1 
429.0 7.4x10° 390.9 0.3 431.3 

4 140227.0 + 6568.0 + 6.5x10° Li¢ 1132.74 6576.2 + 1.2 24+ 7338.1 + 6588.7 + 0.1 1.2+0.1 
735.8 6.8x10° 485.0 1.2 691.4 


The spectra and fits are shown in Extended Data Fig. 1. Fluxes are in units of 10 


20 ergs-1A-1 cm~2, centroids and widths are in A. 
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Extended Data Table 2 | Best-fit parameters of the Gaussian fitting to the blue lobe 


Blue Ha Sky 
Ap. # Peak Flux Centroid Width Peak Flux Centroid Width Peak Flux Centroid Width 

5 1606.5 + 6560.7 + 0.3 2.3 +0.4 159788.0 + 6568.1 + 1.1 £2.5x10" 1484.44 6577.4 +0.3 1520.3 
203.2 295.7 2.4x10° 251.8 

6 1834.4 + 6559.4 + 0.2 1.640.2 126890.0 + 6568.1 + 1.1 +2.6x10° 1677.14 6577.4 +0.2 1.040.2 
210.6 258.1 2.6x10° 257.4 

7 808.1 + 6560.2 + 0.7 3.140.9 85319.3 + 6568.0 + 1.1+4.0x10° 1317.6 + 6577.3 + 0.2 1.2+0.2 
153.9 249.3 3.7x10° 229.2 

8 2792.7 + 6560.8 + 0.2 1540.2 79684.5 + 6567.9 + 1.2+5.1x10° 1378.0 + 6577.5 +0.3 0.94+0.2 
261.5 300.5 5.0x10° 310.8 


The spectra and fits are shown in Extended Data Fig. 2. Fluxes are in units of 10-2° erg s-!A~! cm~2, centroids and widths are in A (the parameters of the [N II] line are not reported in this table 
because they are not directly relevant). 
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Pulsating aurora from electron scattering by 


chorus waves 


S. Kasahara!, Y. Miyoshi’, S. Yokota’, T. Mitani*, Y. Kasahara°, S. Matsuda’, A. Kumamoto®, A. Matsuoka‘, Y. Kazama’, 
H. U. Frey®, V. Angelopoulos’, S. Kurita’, K. Keika!, K. Seki! & I. Shinohara‘ 


Auroral substorms, dynamic phenomena that occur in the upper 
atmosphere at night, are caused by global reconfiguration of the 
magnetosphere, which releases stored solar wind energy”. These 
storms are characterized by auroral brightening from dusk to 
midnight, followed by violent motions of distinct auroral arcs 
that suddenly break up, and the subsequent emergence of diffuse, 
pulsating auroral patches at dawn’. Pulsating aurorae, which 
are quasiperiodic, blinking patches of light tens to hundreds of 
kilometres across, appear at altitudes of about 100 kilometres 
in the high-latitude regions of both hemispheres, and multiple 
patches often cover the entire sky. This auroral pulsation, 
with periods of several to tens of seconds, is generated by the 
intermittent precipitation of energetic electrons (several to tens of 
kiloelectronvolts) arriving from the magnetosphere and colliding 
with the atoms and molecules of the upper atmosphere*’. 
A possible cause of this precipitation is the interaction between 
magnetospheric electrons and electromagnetic waves called 
whistler-mode chorus waves*!!. However, no direct observational 
evidence of this interaction has been obtained so far!”. Here we 
report that energetic electrons are scattered by chorus waves, 
resulting in their precipitation. Our observations were made in 
March 2017 with a magnetospheric spacecraft equipped with a 
high-angular-resolution electron sensor and electromagnetic 
field instruments. The measured!*»!4 quasiperiodic precipitating 
electron flux was sufficiently intense to generate a pulsating 
aurora, which was indeed simultaneously observed by a ground 
auroral imager. 

Theories and computer simulations have provided a promising 
explanation for the origin of electron precipitation: the interaction 
between electrons and whistler-mode chorus waves near the magneto- 
spheric equator”*!>-!©, After chorus waves are generated at the equator, 
they propagate towards higher latitudes (grey wavy arrows in Fig. 1) 
and interact with bouncing, counter-streaming electrons in both 
hemispheres. As these electrons are scattered by the Lorentz force of 
the chorus waves, some of them become nearly aligned with the field, 
fall into the empty electron loss cone (blue open arrow in Fig. 1a), fill it 
(blue filled arrows in Fig. 1a), and precipitate into the upper atmosphere 
(top left in Fig. 1a). 

Data from the THEMIS (Time History of Events and Macroscale 
Interactions during Substorms) spacecraft and ground-based imagers 
have shown a correlation between chorus-wave modulation and auroral 
patch pulsation®!°. Geosynchronous satellite measurements have 
revealed a correlation between electron flux modulation and auroral 
pulsation’’. However, direct evidence of interaction between chorus 
waves and precipitating electrons at the source region—the connection 
needed to verify the whistler-mode chorus-electron interaction theory 


of pulsating aurorae—has not been obtained. This is principally because 
of the large acceptance angle of earlier space plasma instruments, which 
prohibited them from distinguishing small loss cones in the equatorial 
magnetosphere. Thus, it has been difficult to directly establish the asso- 
ciation between precipitating electrons hidden inside loss cones and the 
chorus waves. The high angular resolution of our instrumentation has 
enabled us to overcome this limitation. 

On 2017 March 27, auroral pulsations were observed by the ground- 
based auroral All-Sky Imagers (ASIs) of the THEMIS mission!® (Fig. 2) 
when a magnetic storm with auroral substorm characteristics 
developed. Auroral snapshots illustrate the appearance and disap- 
pearance of pulsating auroral patches. The ionospheric footprint of 
the Exploration of energization and Radiation in Geospace (ERG, also 
called Arase) spacecraft? 14 which was launched in December 2016 
and started observations in March 2017, was traced with an empirical 
magnetic field model!? and plotted over the auroral images (red crosses 
in Fig. 2). The model footprint is well within the fields of view of the 
ASIs in which the auroral pulsations can be clearly detected (see also 
Supplementary Video 1). Depending on the instantaneous magneto- 
spheric configuration, however, the actual footprint may be displaced 
from the model by a few degrees in magnetic latitude and longitude” 
(fewer than several hundred kilometres in both directions). Therefore, 
the model footprint should be regarded as approximately located in a 
wide area where pulsating aurorae were observed (we note that the field 
of view of an ASI is about 1,000 km across, larger than the mapping 
uncertainties). Electrons with sufficient energy flux to cause a visible 
aurora were observed near the equator, closely correlated with chorus 
waves when the footprint of ERG was located in a region replete with 
pulsating auroral patches. 

During morning hours in magnetic local time, ERG captured a cor- 
relation between wave activity and particle flux modulations near the 
magnetospheric equator. During the 30-min duration of the observa- 
tion near the equator, waves are seen intermittently in the power spectra 
of the magnetic (Fig. 3a) and electric fields (Extended Data Fig. 1a) 
in both the lower-band (<0.5f.-) and upper-band (>0.5f,.) frequency 
ranges (where f.. is the electron gyrofrequency). Despite dynamic 
chorus wave activity, northward-streaming electron fluxes outside 
the loss cone (with pitch angle, the angle between the particle velocity 
and the local magnetic field, from 20° to 40°) are relatively stable 
and do not show any apparent correlation with the waves (Fig. 3c). 
This is consistent with the absence of correlation between electron flux 
and chorus waves in previous observations. 

Dramatic modulations of the electron flux in concert with chorus 
waves do occur in the loss cone, however. The northward loss-cone 
electron flux (Fig. 3b), as measured by the Medium-Energy Particle 
experiments - electron analyser (MEP-e) onboard ERG with a 
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Figure 1 | Schematic of electron scattering by chorus waves, resulting 
in a pulsating aurora. a, The open blue arrow represents northward- 
streaming electrons with an empty loss cone (b) before chorus waves 
interact with them (c). The blue filled arrows represent the same electrons 
but with a filled loss cone (d) after their interaction with chorus waves 


3.5° field of view (full-width at half-maximum), shows repeated enhance- 
ments and depletions over a broad energy range from 10 keV to 30keV, 
in striking correspondence with bursts of lower-band chorus 
waves (Fig. 3a). 

The correlation between chorus waves and loss-cone electron flux is 
more clearly presented in Fig. 4, which focuses on the flux of 24.5-keV 
electrons (selected because of the good loss-cone coverage) and a mag- 
netic wave power spectral density of 0.38-0.64kHz. This combination 
of energy and frequency satisfies the cyclotron resonance condition”! 
(thus enabling considerable electron scattering) if we assume an elec- 
tron density of 3cm~?, which is typical of this region”, and it is also 
consistent with the observed frequency of the faint signature of an upper 
hybrid resonance wave (Extended Data Fig. 1a). The excellent correla- 
tion between loss-cone electrons (black solid lines in Fig. 4) and chorus 
waves (blue dash-dotted lines in Fig. 4) indicates that wave—particle 
interaction is indeed taking place. Upon close examination of Fig. 4, 
we see that the loss-cone flux is strongly modulated on a timescale 


The Pas 
2017 March 27 10:48:39 


Figure 2 | Auroral snapshots around the footprint of ERG. Successive 
clear-sky images (10:48:39 to10:48:57 uT) from a ground station 

(the Pas). Distinct pulsating patches in the field of view of the Pas are 
indicated by yellow arrows. The filled and open arrows correspond to 
pulsations that are ‘on’ (bright aurora) or ‘off’ (no aurora), respectively. 
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2017 March 27 10:48:48 


(pitch-angle scattering by waves). The red arrows represent loss-cone- 
filling electrons streaming southwards. The spacecraft location is denoted 
by a black filled circle. b-d, The evolution of the electron PAD. Because 
chorus activity is intermittent, loss cone filling and depletion are repeated, 
causing auroral pulsations. 


of 10s (or less, although the time resolution of the loss-cone meas- 
urement is 8s, and thus shorter periods cannot be resolved), within 
the typical range of auroral pulsations. As expected from low-altitude 
observations*!!, the flux is modulated bya factor of 2 to 3. Furthermore, 
when the loss cone is filled, the measured electron energy flux in the 
atmosphere is sufficient to excite visible aurorae. Specifically, it is 
several times 10°keV cm~*s~ or several erg cm~*s~'—comparable 
to, but above, the threshold for visible aurorae”* (Extended Data Fig. 2 
further illustrates the correlation between auroral patch pulsation and 
the loss-cone electron flux, as well as the chorus waves). 

Figure 4 also illustrates that the variability in the quasiparallel flux 
outside the loss cone (shown by black dashed lines) is nearly stable 
compared to the dynamic modulation of the loss-cone flux and chorus 
wave power. This indicates that the observed electron flux just outside 
the loss cone does not provide clear evidence of precipitation, which 
explains why previous measurements did not reveal the wave-particle 
correlation that causes pulsating aurorae. 


2017 March 27 10:48:57 


The red crosses show the nominal spacecraft footprint. The dotted lines 
illustrate magnetic coordinates every 5° in latitude and 15° in longitude, 
respectively. Supplementary Video 1 shows the full motion of the pulsating 
auroral patches in a 3-s cadence. MLAT, magnetic latitude. 
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Figure 3 | In situ observations by ERG. a, Frequency-time spectrogram of 
the magnetic field power spectral density (in square picoteslas per hertz), 
showing chorus waves. The magenta and white lines indicate 0.5f-. and 
1.0f-., respectively, relative to the local magnetic field. b, c, Energy-time 
spectrograms for differential fluxes of electrons (expressed in electrons 

per (cm? keV sr s)) in the loss cone (pitch angles PA < 2°) (b) and outside 


The interpretation of this observation is illustrated in Fig. 1 
(bottom left box). The spacecraft was in the Southern Hemisphere 
but quite close to the magnetospheric equator. The loss cone of the 
northward-streaming electrons, which is empty during quiet periods, 
fills with scattered electrons as southward-propagating chorus waves 
emerge. The evolution of the pitch-angle distribution (PAD) of the 
northward-streaming electron flux is schematically shown in the right 
panels of Fig. 1b-d. On the other hand, the electron flux in the 
southward-streaming loss cone (red arrows) is also seen in the observa- 
tion (Extended Data Fig. 1d). Although this flux is also modulated (that is, 
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Figure 4 | Correlations between electron flux and wave power spectral 
density. The black solid line shows the modulation of the loss-cone 
electron flux (PA < 2°); the black dashed line indicates the nearly stable 
electron flux outside the loss cone (PA = 20°-40°). The blue dashed-dotted 
line shows the wave magnetic field power spectral density. The electron 


the loss cone, but quasiparallel to the magnetic field (PA = 20°-40°) (c). 
In b, white areas indicate absence of loss-cone measurements owing to 
the instrument’s limited field of view. MLT, magnetic local time of the 
spacecraft; L, distance (in Earth radii) where a magnetic field line passing 
through the spacecraft crosses the magnetospheric equator. 


varies with time), the correlation with the chorus waves is appar- 
ently weaker than that of the northward loss-cone flux (Fig. 3b and 
Extended Data Fig. 1c). For example, chorus waves were absent from 
10:56 to 11:00 universal time (UT), while the southward-streaming loss 
cone continued to be filled. This can be expected because southward- 
streaming electrons must interact with northward-propagating chorus 
waves (presumably present in the Northern Hemisphere), which are 
not necessarily symmetrical with southward-propagating chorus waves. 

We determined the location and physical mechanism of electron 
scattering into the loss cone and subsequent precipitation of energetic 
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energy is 24.5 keV. The frequencies of the power spectral density are 
0.64kHz for a-c and 0.38 kHz for d, selected according to the resonance 
condition. The correlation coefficients (CC) between the loss-cone 
differential flux and the logarithmic power spectral density are shown. 
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electrons, which results in a pulsating aurora. Whether these results 
can be generalized across radial distances from Earth, different local 
times and geomagnetic activity conditions is yet to be determined. 
Comparing loss-cone fluxes with other potential scattering mecha- 
nisms, such as electrostatic electron cyclotron harmonic waves’ and 
large-amplitude parallel electric fields'*, for which the correlated 
energy range is typically lower than that discussed here, would help 
to determine whether other wave types make a similar contribu- 
tion to scattering. The physical process presented here should apply 
to the aurorae of Jupiter and Saturn, where chorus waves have been 
detected™*’*, and may also operate at other magnetized astrophysical 
objects. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Electron analyser. The key observation in our dataset is the electron measurement 
made by the MEP-e sensor onboard the ERG spacecraft. The sensor comprises 
an electrostatic analyser (ESA) specifically designed for energetic particles”° and 
avalanche photodiodes”’ (APDs). The ESA is an energy filter covering 7-87 keV 
with an energy resolution of 8%. The angular resolution is determined by 
combining the focusing property of the ESA with the size of the detector (or 
the APD). The ESA has a disk-like, 21-rad field of view divided into 16 narrow 
elements, each of which corresponds to an APD. Using the spacecraft’s spin motion, 
a full solid angle is swept (with certain dead angles due to gaps between detectors). 
Depending on the geometry (more specifically, the angle between the spacecraft’s 
spin axis and the magnetic field), MEP-e may see the magnetic field direction (and 
thus observe the loss cone) once or twice in one spacecraft spin, as in the presented 
case. The sensor may also miss the magnetic field directions (the magnetic field 
direction drops at the gaps) or the loss cone may be observed for a longer time 
(if the spin axis is along the magnetic field direction). Neither of these scenarios 
occurred in the time period reported here. 

Angular resolution. The most important property of the MEP-e sensor for 
this study—the angular resolution in two orthogonal directions—is shown in 
Extended Data Fig. 3. These calibration data were obtained by pre-flight laboratory 
experiments. Profiles are essentially the same for all 16 channels. Here the data are 
modelled with a Gaussian shape and a full-width at half-maximum of 3.5° for use 
in the evaluation of the loss-cone determination discussed below. 

Loss cone determination. In Figs 3b and 4, as well as in Extended Data 
Fig. 1c and d, we have plotted the loss-cone electron flux. In these analyses, we used 
the following criterion for loss-cone measurements: when the angle between the 
centre of the field of view of the MEP-e and the magnetic field is smaller than 2°, 
the detected electrons are inside the loss cone. This approach needs caution 


LETTER 


regarding contamination from electrons outside the loss cone, because a fraction of 
the field of view can extend outside the loss cone. We carefully evaluated this effect 
(Extended Data Fig. 4) and found it to be insignificant (although not negligible), 
confirming the robustness of our analyses. 

Estimation of precipitating electron energy flux. The downward electron 
energy flux F at the ionosphere, which contributes to the auroral illumination, 
is estimated as F © (Bi/Beq) E Jeq AS2 AE, where the parameters are the magnetic 
field strength at the ionosphere (B,~ 55,000 nT) and at the equator (Beq), the 
electrons’ characteristic energy (E), the differential flux at the equator (Jeq), the 
solid angle of the loss cone (A) and the energy range of the precipitating electrons 
(AE). On the basis of our in situ observation, we take Beg 100 nT, E~ 20keV, 
Jeq® 4X 10°keV cm~*sr-!s-'keV! (here we take the peak value), AQ 
6x 10-3 sr and AE~ 20keV to obtain a downward electron energy flux of about 
5x10°keV cm~?s"!, or 8ergcm?s!. 

Data availability. The ERG data presented here are publicly available from the 
ERG science centre website (https://ergsc.isee.nagoya-u.ac.jp/dataset/2017.1-1001. 
shtml.en). THEMIS-ASI data that support the findings of this study are publicly 
available at http://themis.ssl.berkeley.edu/data_all.shtml. 

Code availability. The software used to read and analyse the data, part of 
the publicly available SPEDAS software package, is available at http://themis.ssl. 
berkeley.edu/software.shtml and can be used without any restrictions. All data have 
a common data file format (https://cdf.gsfc.nasa.gov/). 
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Extended Data Figure 1 | In situ observations by ERG with an additional (PA =20°-40°) and quasiantiparallel (PA = 140°-160°) electrons 
dataset. a, b, Frequency-time spectrograms of the power spectral densities _ show essentially the same trend as that of the perpendicular 


of the electric (a) and magnetic field (b), showing chorus waves. The flux. f, Flux(PA < 2°)/flux(PA = 20°-40°); g, flux(PA > 178°)/ 
magenta and white lines indicate 0.5f.. and 1.0f.., respectively, based on flux(PA = 140°-160°). The graphs in b and c are the same as those in 
local magnetic field observations. c-e, Energy-time spectrograms for Fig. 3a and b, respectively (but replotted here for comparison with a 
differential fluxes of loss-cone electrons parallel (pitch angles PA < 2°) and d). The faint signature of an upper hybrid resonance wave at 

(c) and anti-parallel (PA > 178°) (d) to the magnetic field and electrons 12-16 kHz in a at about 11:10 uT is consistent with the assumed 
perpendicular to the magnetic field (PA = 80°-100°) (e). Quasiparallel density of approximately 3 cm~°. 
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Extended Data Figure 2 | Correlation coefficients for auroral intensity. 
a, The colours (red, yellow, cyan and magenta) show the correlation 
coefficients between the auroral intensity and the loss-cone electron flux. 
b, Time series data of the loss-cone electron flux and the auroral intensity 
at a pixel at which electrons and chorus waves have nearly the highest 
correlation. The auroral intensity is plotted at the same time resolution as 
the electrons (about 8 s). c, d, The same as a and b, but for the chorus wave 
intensity. The wave intensity in d is plotted at the same time resolution 

as the auroral intensity (3 s). In a and c, the background auroral images 
are magnified around the centre of the field of view of the Pas station. 
Highest-correlation pixels are consistently located near the centre of both 
panels, suggesting the spacecraft footprint. The dashed lines in a and c 
illustrate magnetic coordinates every 2° in latitude and 5° in longitude. 
The displacement of the model footprint from the high-correlation pixels 
is approximately —0.5° and —5° in latitude and longitude, respectively, 
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consistent with typical modelling errors”’. Cross-correlations were 
calculated for the period 10:54:00-10:58:00 ut. In other time periods, 
high-correlation pixels were not commonly obtained, perhaps because 
of the fine structures of pulsating patches and the equatorial modulation 
regions near the spacecraft. For example, if the spacecraft leaves the 
localized modulation region as a result of magnetospheric configuration 
change, chorus waves and associated electron precipitation disappear at 
its location, but pulsating patches can continue to be ‘or’ if the equatorial 
modulation region still exists. In other words, although the spacecraft’s 
footprint in the ionosphere can leave an illuminated patch owing to spatial 
reconfiguration of magnetic field line structures or plasma phenomena, 
the auroral intensity remains high at some pixels. Other reasons that 
make the above correlations difficult to identify, such as the contribution 
of soft electron (<10 keV) precipitation to higher-altitude (>100 km) 
illumination, may be studied in future work. 
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Extended Data Figure 3 | The angular response of MEP-e in two Blue circles, laboratory data; black line, Gaussian model. The model curves 


were used to obtain the analysis results shown in Extended Data Fig. 4. 


orthogonal directions. a, Sensor response as a function of elevation angle 
Profiles for one detector are shown here; similar profiles were obtained for 


with respect to the sensor’s mounting plane. b, Response in the sensor’s 
azimuthal direction, which is orthogonal to the elevation angle. the other 15 detectors. 
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Extended Data Figure 4 | Results of PAD model taking the sensor’s 
angular resolution into account. Because of the finite angular resolution, 
contamination from outside the loss cone cannot be completely negligible. 
The blue line indicates the model input PAD, which is isotropic except for 
the step-function drop at the loss-cone angle of 2.5° (the nominal loss- 
cone angle in the event presented in this paper, based on a local magnetic 
field of about 100 nT). The red curve shows how the electron PAD is 
modulated by the effect of the sensor’s finite angular resolution. The grey 
dashed line indicates the threshold, 2°, for loss-cone selection (that is, 

if the angle between the centre of the field of view and the magnetic field 
is smaller than the threshold, the measured flux is considered to be the 


flux inside the loss cone). For this calculation, the sensor’s field of view is 
modelled by a Gaussian cone (full-width at half-maximum, 3.5°), based 
on the ground calibration. For example, even when the middle of the 
sensor’s field-of-view is centred along the magnetic field line and the 
actual electron PAD has an ideally empty loss cone, the instrument can 
inadvertently record about a few tens per cent of the flux from outside the 
loss cone. In our observations, however, the electron flux in the loss cone 
most often exhibits a filling ratio larger than 0.5, sometimes about 1, when 
the precipitation is ‘on (Fig. 4 and Extended Data Fig. 1f and g), too large 
to be explained by contamination alone. Also, synchronization with chorus 
waves cannot be produced by this instrumental effect. 
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The honeycomb lattice is one of the simplest lattice structures. 
Electrons and spins on this simple lattice, however, often form 
exotic phases with non-trivial excitations. Massless Dirac 
fermions can emerge out of itinerant electrons, as demonstrated 
experimentally in graphene’, and a topological quantum spin liquid 
with exotic quasiparticles can be realized in spin-1/2 magnets, as 
proposed theoretically in the Kitaev model*. The quantum spin 
liquid is a long-sought exotic state of matter, in which interacting 
spins remain quantum-disordered without spontaneous symmetry 
breaking*. The Kitaev model describes one example of a quantum 
spin liquid, and can be solved exactly by introducing two types of 
Majorana fermion”. Realizing a Kitaev model in the laboratory, 
however, remains a challenge in materials science. Mott insulators 
with a honeycomb lattice of spin-orbital-entangled pseudospin-1/2 
moments have been proposed’, including the 5d-electron systems 
a-Na2IrO;3 (ref. 5) and a-Li,IrO; (ref. 6) and the 4d-electron 
system c-RuCl; (ref. 7). However, these candidates were found to 
magnetically order rather than form a liquid at sufficiently low 
temperatures*!°, owing to non-Kitaev interactions®!!-3, Here 
we report a quantum-liquid state of pseudospin-1/2 moments in 
the 5d-electron honeycomb compound H3;Lilr.O.¢. This iridate 
does not display magnetic ordering down to 0.05 kelvin, despite 
an interaction energy of about 100 kelvin. We observe signatures 
of low-energy fermionic excitations that originate from a small 
number of spin defects in the nuclear-magnetic-resonance 
relaxation and the specific heat. We therefore conclude that 
H3Lilr,O,¢ is a quantum spin liquid. This result opens the door 
to finding exotic quasiparticles in a strongly spin-orbit-coupled 
5d-electron transition-metal oxide. 

According to the third law of thermodynamics, magnets with many 
spin degrees of freedom must have zero entropy in the ground state. 
The conventional way of releasing the entropy is to form a magnetically 
ordered state. However, there is another way of forming a macroscopi- 
cally non-degenerate state: through quantum effects, which hinder 
long-range order. This exotic state of matter is called a quantum spin 
liquid’. Since the first conjecture of the resonant-valence-bond state!4, 
geometrically frustrated S = 1/2 antiferromagnets (where S is the 
spin) on a triangular-based lattice have been studied experimentally 
and theoretically to explore the quantum-spin-liquid state. Although 
the resonant-valence-bond state is believed to be a quantum super- 
position of spin singlets, it has not been obtained as an exact solution 
of any model Hamiltonian. Despite the theoretical challenges, it has 
been argued on the basis of experiments that certain antiferromag- 
netic, S= 1/2 triangular and kagome materials are quantum spin 
liquids, including the organic materials k-(BEDT-TTF)2Cu2(CN)3 
(ref. 15) and EtMe3Sb[Pd(dmit).], (refs 16, 17) and the inorganic 
material ZnCu3(OH) Cl, (refs 18-20). 


Recently, the simple honeycomb lattice has become important 
in the exploration of quantum spin liquids, owing to a theoretical 
breakthrough”. Kitaev” proposed a model in which $= 1/2 spins on 
a honeycomb lattice are coupled to their three nearest neighbours 
by ferromagnetic Ising interactions, with bond-dependent easy axes 
parallel to the x, y and z axes (Fig. 1a). The orthogonal anisotropy of the 
three nearest-neighbour bonds creates a conflict between these bonds, 
giving rise to strong magnetic frustration. By introducing Majorana 
operators, Kitaev showed rigorously that the ground state of the asso- 
ciated Hamiltonian is a quantum spin liquid. Topological excitations 
are anticipated, originating from the itinerant Majorana fermions with 
Dirac dispersion and from the localized Majorana fermions that give 
rise to Z, gauge flux. The Kitaev model provides an alternative pathway 
to a quantum spin liquid compared to the resonant-valence-bond case, 
with the advantage that it has an exactly solvable ground state. 

The Kitaev model was thought to be a toy model for theorists 
because pure S= 1/2 spins in general do not accommodate strong 
Ising anisotropy. However, 5d transition-metal oxides, in particular 
Ir** oxides with five d electrons, were recently shown to be promising 
candidates for Kitaev spin liquids. In complex Ir** oxides with 
octahedral coordination of oxygen ions, a spin-orbital Mott state with 
pseudospin Jeg= 1/2 moments is often formed”!. If neighbouring IrO¢ 
octahedra share an edge, then super-exchange interactions give rise to 
a ferromagnetic Ising interaction of J.f= 1/2 moments with easy axes 
perpendicular to the Ir-O,-Ir plane—the essential ingredient of the 
Kitaev model’. 

The spin-orbital Mott insulators «-NazIrO3 and «-Li,IrO; crystallize 
ina layered structure, in which IrOg octahedra form a honeycomb net- 
work by sharing the three orthogonal edges of an octahedron (Fig. 1b). 
These honeycomb iridates, and more recently the closely related com- 
pound a-RuCl;, have been studied extensively as possible materiali- 
zations of the Kitaev model*-’. However, their ground states were 
found to be magnetically ordered*""° rather than forming quantum 
spin liquids, although signatures of the existence of Kitaev interactions 
were captured”, It was argued that additional interactions, such as 
Heisenberg interactions generated by d-d exchange coupling, compete 
with Kitaev-type interactions and stabilize the magnetically ordered 
state", Indeed, the Curie-Weiss temperature Ocw estimated from the 
magnetic susceptibilities of a-NazIrO3 and a-Li,IrO; is not positive, as 
would be expected for Kitaev-type ferromagnetic interactions, which 
points to the presence of additional antiferromagnetic interactions. 

The difficulty in realizing a Kitaev spin liquid in the aforementioned 
candidates motivated us to explore other honeycomb-based Ir** oxides. 
During this exploration, we discovered a quantum-spin-liquid state 
in the honeycomb Ir oxide H3LilrO¢, in which all of the interlayer 
Li* ions of «-Li,IrO; are replaced with H* ions in an ion-exchange 
reaction” (Fig. 1d), but the Lilr,O. honeycomb plane remains as it 
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Figure 1 | Crystal structure and basic physical properties of H3Lilr2O.. 
a, Kitaev model on a honeycomb lattice. § = 1/2 spin moments (indicated 
by arrows) are present on the honeycomb lattice, coupled by bond- 
dependent ferromagnetic Ising interactions. The three 120° bonds with 
orthogonal Ising axes compete with each other, giving rise to strong 
magnetic frustration and hence a quantum-spin-liquid state. b, Lilr,0¢ 
layer unit with an edge-shared network of IrO¢ octahedra for a-Li,IrO3 
and H3Lilr2Og. Ir** ions with Je¢= 1/2 moments form a honeycomb 
sublattice, as indicated by the dotted lines. The edge-shared Ir-O-Ir bond 
gives rise to ferromagnetic Ising interaction between the neighbouring 
Jetp= 1/2 moments. ¢, Layer stacking of a-Li,IrO; (ref. 30). d, Layer 
stacking of H3Lilr,O, (ref. 30), in which the interlayer Li* ions are 
replaced with H™ ions. e, Magnetic susceptibility y(T) for HsLilr.O¢ 


was (Fig. 1b)*°. Powder X-ray diffraction indicates the presence of 
stacking faults between the honeycomb planes (Methods, Extended 
Data Fig. 1). The presence of magnetic defects with a density of 1%, 
which probably originate from impurities or vacancies, is indicated 
by the temperature- and magnetic-field-dependent magnetization 
M(T, B) and the low-temperature specific heat C(T, B) (Extended 
Data Fig. 2). 

The resistivity p(T) of H3Lilr.O¢ (Extended Data Fig. 3) exhibits 
insulating behaviour with an activation energy of approximately 
0.12 eV. In addition, Curie-Weiss behaviour is observed in the mag- 
netic susceptibility .(T) with an effective moment of approximately 
1.60jug per Ir atom (where jug is the Bohr magneton) at temperatures 
above 200K (Fig. le). These findings indicate that H3Lilr.O¢ is a 
spin-orbital J.g= 1/2 Mott insulator. We find a negative value for Ocw 
of —105 K, which implies overall antiferromagnetic interactions, as in 
other honeycomb Ir oxides*®. The energy scale of magnetic interactions 
is of the order of 100K, but no trace of magnetic ordering is observed 
in y(T) down to 2K, in sharp contrast to other Kitaev candidates>-7. 
Accordingly, C(T) down to 0.05 K (Fig. 1f) does not show any signature 
of a phase transition. These results suggest that the ground state of 
H;Lilr2O¢ is a liquid state of J.¢¢= 1/2 moments. 

x(T) and C(T) are not very sensitive to a weak or glassy magnetic 
ordering. We therefore conducted ’Li and 'H NMR measurements 
on aligned powders (Fig. 2a, b) to exclude the possibility of any 
magnetic ordering. The absence of apparent peak splitting or broad- 
ening on cooling (see also Extended Data Fig. 4a) clearly indicates 
that the system remains paramagnetic down to 1K, roughly 1% of the 
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measured at 1 T. The raw data (solid line and circles) is shown along with 
the presumed intrinsic y (dotted line) after numerically subtracting a 
low-temperature Curie-like contribution, which probably originates from 
magnetic defects (see also Extended Data Fig. 2a). The inset shows 1/y as 
a function of temperature to emphasize the high-temperature Curie-Weiss 
behaviour of localized Jerg= 1/2 moments. The extrapolation to T=0 
provides an estimate of the antiferromagnetic Curie-Weiss temperature of 
Ocw=—105K. f, Specific heat C as a function of T (main panel) and C/T as 
a function T* (inset) down to 0.05 K, indicating no signature of magnetic 
ordering. The large non-lattice contribution suggests the presence of low- 
lying spin excitations. The nuclear Schottky contribution is subtracted 

(see Methods). 


energy scale of magnetic interaction. We therefore conclude that a 
quantum-spin-liquid state is realized in honeycomb H3Lilr.O.¢. 

The Knight shift K(T) represents an intrinsic magnetic susceptibility, 
free from magnetic defects. In Fig. 3a, K(T) is shown for ’Li with mag- 
netic fields parallel and perpendicular to the honeycomb plane and no 
indication of magnetic ordering is observed. The sizable anisotropy 
in the susceptibility of up to around 2 between the two field orienta- 
tions should originate from spin-orbit coupling, but is not expected 
for the Kitaev model with equal Ising couplings on the three bonds. Its 
existence probably implies an anisotropy in the magnitude of the three 
Ising couplings and/or the presence of off-diagonal interactions'””®. 
On cooling below about 200K, K(T) deviates from the Curie-Weiss 
behaviour that is seen at higher temperatures, exhibiting a broad peak 
at around 130 K followed by a gradual decrease to a non-zero value. The 
non-zero susceptibility in the low-temperature limit is analogous to the 
susceptibilities observed in organic spin liquids!*!°, which have been 
interpreted as evidence for gapless spin excitations. However, spin-orbit 
coupling can lead to a non-zero susceptibility even in a spin liquid with 
a finite excitation gap; because the spin-orbit coupling is strong for Ir 
(Aso ¥ 0.5 eV), the non-zero susceptibility does not necessarily imply 
gapless excitations. 

The presence of low-lying spin excitations is captured by the NVR 
spin relaxation rate Ti. In Fig. 3b we plot (T,T)~ | which measures the 
density of spin excitations. At a low magnetic field of B=1T, (T\T)~! 
for 'H and ’Li remains non-zero and shows only weak temperature 
dependence below about 40K. With increasing B, however, (T\T)! at 
low temperatures is suppressed and decreases rapidly below a 
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Figure 2 | Evidence from NMR spectroscopy of a spin-liquid ground 
state in H3Lilr,0¢ down to 1.0 K. a, b, Li (a) and 'H (b) NMR spectra 
for a magnetic field B=5 T and 2 T, respectively, applied parallel to the 
honeycomb plane (||). No appreciable broadening of the peak is observed 
down to 1.0K, which provides evidence for intact paramagnetism and 
hence a spin-liquid state. The dotted vertical line indicates the spectral 
positions of nuclei without internal fields (K=0). In 7Li NMR, only one 
peak, assigned to Li ions in the honeycomb layer, is observed, indicating 
that no Li ions are left in the interlayer sites after the ion-exchange process. 
The spectra are offset and coloured in various shades for clarity. 


characteristic temperature T. T., at which a knee-like structure is 
observed, increases linearly with B as kgT ~ ppB, where kg is the 
Boltzmann constant. This behaviour can be understood as a suppres- 
sion of low-lying spin excitations with the application of a magnetic 
field over an energy scale of the Zeeman energy, /1pB. We note that K(T) 
does not depend on magnetic field at low temperatures, in marked 
contrast to the strong B dependence of (TT) ~'. The temperature- 
independent and non-zero static susceptibility K(T) at low tempera- 
tures must be dominated by different physics from T}, indicative of the 
presence of a duality in the magnetic response. 

The low-lying spin excitations are identified also in C(T) at low 
temperatures (Fig. 4a). On cooling in B=0, C/T does not approach zero 
and shows a weak power-law increase of C/T x T~/2, which suggests 
the presence of highly degenerate low-lying excitations around energy 
E=0. With the application of B, a peak emerges in C/T at roughly 
T (B)/2, where T’ is the knee temperature in (T;T)~'. Below the peak, 
C/T decreases rapidly with decreasing T, almost linearly with T in the 
low-T limit, consistent with T,. C/T « T is expected for a V-shaped 
density of states D(E) x |E|. The entropy S(T, B) shown in Fig. 4b is 
calculated from C/T and does not depend appreciably on B above T’, 
indicating the weight transfer of the intense low-lying excitations over 
the energy scale of jpB. The field dependence of S(T, B) is linked to 
the temperature dependence of the magnetization M(T, B) through the 
Maxwell relationship (0S/OB);=(OM/0T)p. In M(T, B)/B for the same 
sample, we observe a Curie-like increase in the low-B limit at low tem- 
peratures, which is superposed on the intrinsic and T-independent 
susceptibility K(T) (Extended Data Fig. 2a). The T-dependent part of 
M(T, B)—the Curie-like contribution—is well reproduced from the 
B dependence of S(T, B), which means that the low-lying excitations 
observed in C originate from the Curie-like contribution in M/B. 

We model the observed excitations in terms of an energy-symmetric 
fermionic density of states D(E, B) (Fig. 4c). In zero magnetic field, 
the density of states D(E, 0) = I|E| ~1? (where Tis a constant) has an 
E=0 singularity, giving rise to C/T x T~'?. At a non-zero field B, 
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Figure 3 | The Knight shift and relaxation rate for H3Lilr,O¢ reveal the 
intrinsic static susceptibility and spin dynamics. a, Knight shift K, which 
represents the intrinsic spin susceptibility, for ’Li at fields B=2T and5T 
parallel to the honeycomb plane (||) and 2 T perpendicular to the plane 
(1). No appreciable B dependence of K is observed. K remains non-zero 
and constant from 40 K to below 1 K. b, NMR relaxation rate TG? plotted as 
(T,T)~' as a function of T, measured for B parallel to the honeycomb 
plane. (T;T)~' measures the momentum q-averaged imaginary part of 
susceptibility y”(q, wo)/w at a very low-energy wy, which reflects the 
density of low-energy spin excitations. The almost temperature- 
independent behaviour at low temperatures indicates the presence of a 
non-zero density of low-lying spin excitations. (TT) ~’ is particularly 
suppressed below T° ~ j1gB/kg, which indicates the opening of a B-induced 
excitation gap on an energy scale of the Zeeman energy. The error bars 
indicate the standard deviations in the fitting of the nuclear relaxation with 
a stretched exponential function (see Method). The inset shows a scaling 
plot: T;' versus T/B. The data for 'H (green and light blue) are multiplied 
by a factor of 0.16 to overlap with those for 7Li (blue, dark blue and red). 
The dashed line indicates the T; calculated for ’Li (see Extended Data 

Fig. 5) using the density of states model shown in Fig. 4c. 


the low-energy region below ayupB (|E| < ajpB) (where ais a constant) 
is replaced with the V-shaped function D(E, B) =4I/E|/(appB)*”, 
which represents the excitation gap. Because J’is a constant, the total 
number of states is conserved. We found that, by Fermi-averaging over 
kgT, the simple model D(E, B) with a =2.9 and ’'=4.3 x 108 yl? per 
Ir atom reproduces both C(T, B) and T\(T, B) reasonably well at low 
temperatures, as indicated by the dashed lines in the insets of Figs 3b 
and 4a. Incidentally, the presence of an excitation gap scaled by ajipB 
manifests as a scaling of C, T; and M as a function of T/B (Extended 
Data Figs 2c, 5). 

The NMR spin relaxation rate T;' clearly captures the same 
fermionic excitations as C, with an E=0 singularity and a likely 
particle-hole symmetry, which originates from the Curie-like contri- 
bution in M/B. The absence of the Curie-like contribution in K(T) 
implies that these fermionic excitations are local and produced by 
magnetic defects embedded in the bulk spin liquid, the density of which 
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Figure 4 | Low-lying excitations are captured by the specific heat. 

a, Specific heat C obtained after subtracting the nuclear contribution, 
plotted as C/T versus T for various fields B. The T~"’? dependence of C/T 
at B=0 implies the presence of intense excitations around E=0. With 
the application of B, the low-temperature C/T below 1/2T" © (ugB/kp)/2 
is suppressed, which implies that an excitation gap opens over an energy 
scale of the Zeeman energy. In the T= 0 limit, C/T under B shows linear 
behaviour as a function of T, which suggests the formation of a gap that 
increases linearly with E. The inset shows a scaling plot: CBY?/T versus 
T/B. At low T/B < 0.5, all data points (filled circles) measured at different 
magnetic fields fall onto the universal curve. The dashed line indicates 
CB"?/T calculated on the basis of the density of states model shown in 

c; see also Extended Data Fig. 5). CB"”?/T increases rapidly with T/B and 
deviates from the universal behaviour for large T/B > 1, possibly owing to 
the contribution of the lattice and/or the magnetic excitations in the bulk. 
b, Entropy S(T, B) calculated from the C/T data in a. With increasing T 
(well above T’), S(T, B) recovers to S(T, 0). ¢, Suggested model of 
E-symmetric density of states D(E, B). The dashed black line indicates 

D(E, 0) =IE|~"”?, which weakly diverges at E=0. In a magnetic field B= Bo, 
a B-induced excitation gap, represented by D(E, By) =4I°|E|/(upBo)*”, 
which is linear in E, is formed for |E| < ajipBo to conserve the total number 
of states, as indicated by the solid magenta line. The magenta arrows 
indicate the transfer of states associated with the formation of the gap. 
The states for E < 0 (shaded area) are occupied. ‘and a are constants. 


is estimated to be roughly 1% in our sample. The majority of the nuclear 
spins are spatially distant from the defects and feel only a negligibly 
small static dipole field from them, but they have a finite dipole cou- 
pling with the local excitations, which governs the nuclear relaxation. 
Theoretical studies based on the pure Kitaev model indicate that the 
quantum-spin-liquid state is robust against the presence of a small 
number of spin vacancies and moderate random bond modulations 
and that a trapped Z, flux gives rise to local magnetic excitations”. 
The behaviour of T; at high temperatures (Fig. 3b) provides 
us with information on the interactions among Je= 1/2 moments. 
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The relationship (T,T) | x T~! around room temperature is consistent 
with the existence of thermally fluctuating moments. On lowering the 
temperature below 130K, where K(T) shows a decrease, (TT)! for 
7Li shows a pronounced decrease down to 50 K, whereas (TT)! for 
'H shows a monotonic increase (see also Extended Data Fig. 6a). This 
marked difference between the behaviour for ’Li and 'H comes from 
the form factors. Li atoms are located symmetrically at the centres of 
Jet¢= 1/2 hexagons, where the fluctuating hyperfine fields cancel if anti- 
ferromagnetic correlations of J.¢= 1/2 moments develop over the hexa- 
gon. H atoms are located above the honeycomb layer and near one of the 
Ir-Ir bonds, where similar cancellation would be observed if nearest- 
neighbour antiferromagnetic correlations were to develop. The absence 
of (T;T)~! suppression for 'H therefore implies much reduced antifer- 
romagnetic or possibly ferromagnetic nearest-neighbour correlations 
as compared with the well-developed antiferromagnetic over-hexagon 
correlations. This is consistent with the much-discussed superposi- 
tion of long-ranged antiferromagnetic correlations and Kitaev-type 
nearest-neighbour ferromagnetic interactions®'*®. Note that owing 
to the isotropy of Heisenberg antiferromagnetic interactions, they are 
multiplied by a factor of three in the mean-field calculation of @cw 
for x(T) (ref. 26). This enhancement can explain in part why y(T) in 
Fig. le shows an antiferromagnetic Ocw. 

The data presented here firmly establishes the realization of a 
quantum-spin-liquid state on the honeycomb lattice in H3Lilr,O¢. The 
strong relativistic spin-orbit coupling that is inherent to heavy 5d Ir 
ions and the resultant bond-dependent anisotropic coupling is probably 
the key to the formation of the liquid phase. Recent numerical calcu- 
lations for the pure Kitaev model”*”? show a sharp peak in C with an 
entropy of as large as 50% of kgln(2) that originates from the ordering 
of Z; fluxes at very low temperatures, regardless of whether the system 
is in the gapped or the gapless phase. The observed low-energy excita- 
tions captured by C, however, correspond to an entropy of only 1%-2% 
of kgln(2) at about 5 K (Fig. 4b). Most of the observed entropy can be 
ascribed to local fermionic excitations associated with magnetic defects. 
The entropy that originates from the bulk quantum spin liquid should 
therefore be negligibly small, much less than 1%-2% of kgln(2) at 5K 
(Extended Data Fig. 7). This observation might indicate that additional 
interactions and disorders modify the elementary excitations or even 
the nature of the liquid from the pure Kitaev case. However, we cannot 
exclude completely the possibility that the large-flux entropy of the pure 
Kitaev model is hidden below the lowest temperature that we meas- 
ured (0.05 K; roughly 10~“ times the interaction energy). According 
to numerical calculations”®, flux entropy can be hidden in the Kitaev 
model with non-equivalent Ising bonds. Experiments at ultralow 
temperatures and further theoretical calculations with realistic ani- 
sotropy, additional interactions and magnetic defects are needed to 
identify the putative topological excitations in the quantum spin liquid. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample synthesis and characterization. Polycrystalline powder of a-Li,IrO3 was 
prepared as a precursor via a solid-state reaction of LixCO3 and IrO). a-LigIrO3 
powder was sealed in a Teflon-lined steel autoclave with 4M H2SO, aqueous 
solution, and heated at 120°C for 96h. The product was washed with distilled water 
and dried at room temperature. The powder obtained was stable in air. Electrical 
resistivity and magnetic susceptibility above 2 K were measured by using a Physical 
Property Measurement System (PPMS) and a Magnetic Property Measurement 
System (MPMS) (Quantum Design) on cold-pressed pellets. 

Structural analysis. The crystal structure of the product was analysed by using 
powder X-ray diffraction (Extended Data Fig. 1). The diffraction data were 
collected at room temperature with a Stoe Stadi-P transmission diffractometer 
(using a primary beam Johann-type Ge(111) monochromator for Ag Ka, radi- 
ation (A=0.55941 A) and a Mythen-Dectris position-sensitive detector with 
12° 20 opening), with the sample sealed in a glass capillary of 0.5-mm diameter 
(Hilgenberg, glass No. 50). The sample was spun during measurement for better 
particle statistics. Rietveld refinement was performed using the program TOPAS 
version 4.2 (Bruker AXS, 2010). 

The presence of stacking disorder makes the diffraction peak extremely 
asymmetric, which made a precise determination of the detailed structure using 
the conventional technique extremely difficult. We therefore took a two-step 
approach”. First, the ideal structure without stacking faults was obtained by 
eliminating peaks with strong influence of stacking faults in the refinement. The 
structure obtained is shown in Fig. 1b, d and Extended Data Table 1. With the 
replacement of Li with H, the interlayer spacing c= 4.8734(6) A becomes smaller 
than that of a-LipIrO3 (c=5.1219(2) A). This is consistent with the much smaller 
ionic radius of Ht than Lit. O?~ ions in the two adjacent honeycomb layers 
face each other (Fig. 1d), in contrast to the situation in a-LiIrO3 (Fig. 1c). The 
location of the H atoms was not included in the refinement because of the very 
weak scattering from H atoms. However, it is natural that H* ions bridge the two 
facing O?~ ions. Site exchanges between Li and H are not considered. 7Li NUR 
spectra revealed only one Li site that can be assigned to Li atoms in the honeycomb 
layer. This means that all of the interlayer Li ions are replaced with H after the ion 
exchange. Occupation of the Li site in the honeycomb layers by H atoms is very 
unlikely because of the contrast of ionic size. 

The stacking disorder was then treated in the analysis of the powder X-ray 
diffraction by assuming the faulted stacking of finite-thickness lattice slabs of 
ideal structure?>". With a slab thickness of 12 layers, we can reproduce the whole 
powder X-ray diffraction pattern reasonably. We therefore conclude that the crystal 
structure of H3Lilr,Og is essentially as given in Extended Data Table 1, but with 
occasional stacking faults on average every 12 honeycomb layers. 

Specific heat measurements. The specific heat C was measured by using a 
relaxation method in the temperature range 0.3-300 K using a commercial set-up 
(Quantum Design, PPMS), and separately at very low temperatures of 0.05-1K 
using a cell installed in a dilution refrigerator. In the ultralow-temperature meas- 
urement, the sample was wrapped in an annealed silver foil (residual resistivity 
ratio of more than 1,000) and fixed by silver paste to ensure a sufficient thermal 
link between the sample and the sample stage. The data measured with the dilu- 
tion refrigerator were analysed using the relaxation method and a quasi-adiabatic 
method*. The C values obtained by the two methods agree within the experimental 
resolution, confirming the reliability of the C data obtained. The results were 
reproducible for two different samples from the same batch of powder. 

NMR measurements. NMR measurements were carried out using a standard 
coherent pulse spectrometer, cryo-cooled preamplifier, double-rotation probe 
and *He-circulation lines. The NMR frequency-swept spectra were acquired 
using a Fourier-step-sum technique, whereby each Fourier-transformed spectrum 
shifted with its centre frequency is accumulated during the sweep. The quadrupole 
splitting for °’Li nuclei (nuclear spin > 1/2) are negligibly small (less than 20 kHz). 
The oriented powder sample of H3Lilr2O¢ was used for the NMR measurements 
(Extended Data Fig. 8). Preferred orientations were confirmed by the X-ray dif- 
fraction pattern and the NMR spectral shape. The oriented powders show a sizable 
anisotropy in the magnetic susceptibility and Knight shift (Extended Data Fig. 4c). 

The integrated spectral intensity of the NMR line corresponds to about 80% 
of the expected value (Extended Data Fig. 4b). The remaining 20% of nuclei 
seem to be located close to defect centres, where slow spin dynamics breaks the 
coherency of NMR signals (short spin-spin relaxation time T,). Note that the 
observed behaviour is distinct from the 100% ‘wipe out’ effect that has been 
widely encountered in the geometrically frustrated materials SrCrgGa4Oj9 
(ref. 33), NaCrO (ref. 34), NiGa2Sy (ref. 35) and PbCuTe2Og (ref. 36). 

Knight shifts and hyperfine coupling constants. The Knight shift K(T) was 
determined as the position of the NMR peaks in Fig. 2 and represents an intrinsic 
susceptibility. The nuclear gyromagnetic ratios 7,/(27) for 'H, °Li and Li that were 


used to determine the K=0 origins are 42.57639 MHz T~!, 6.265578 MHz T"! 
and 16.54680 MHz T" |, respectively*”. If we subtract the Curie-like term at low 
temperatures in the bulk susceptibility (7), then K(T) for 7Liin Fig. 2 and \(T) 
agree reasonably, which gives an estimate of the nearly isotropic hyperfine coupling 
constant for 7Li of Anr= 0.44 T/jup (Extended Data Fig. 4c, d). This agreement 
confirms that the low-temperature increase in the magnetic susceptibility .(T) 
originates from impurities or defects. K(T) for 'H appears to have similar temper- 
ature dependence to that of ’Li, but the extremely small shift of less than 100 p.p.m. 
(see Fig. 2b) does not allow us to estimate the hyperfine coupling for 'H reliably; 
a rough estimate of App 0.1 T/jug is obtained from a magnetic dipole calculation. 
NMR line width. Clear broadening of NMR peaks has been observed in the 
geometrically frustrated spin liquids reported so far'*'®'”° at low temperatures, 
of the order of 1% of the magnetic coupling J, although no clear peak splitting has 
been identified. The peak broadening in the geometrically frustrated spin liquids 
corresponds to the static distribution of a magnetic moment of (0.01-0.1) jug if 
the peak width is converted to the unit of magnetic moment using the hyperfine 
coupling constant Apy. In the case of H3Lilr,O¢, the broadening in the unit of 
magnetic moment is not zero, but is extremely small (at most 0.002/1g even for 
the locally less-symmetric H sites). This estimate was made on the basis of the 
temperature dependence of the half-width at half-maximum (HWHM) of the 
NMR peak in Extended Data Fig. 4a. The pronounced temperature dependence 
of the HWHM from room temperature down to 40K is ascribed to the demag- 
netization effects. Only the temperature dependence below 40 K (0.2 mT at most) 
reflects the distribution of static moments, which correspond to 0.00045 /1p and 
0.002/1 for 7Li and 'H, respectively. In the conversion to magnetic moment, the 
isotropic hyperfine coupling constant for 7Li Ajp=0.44 T/jug and the mean square 
of the magnetic dipole interaction for 'H |Agip| © 0.1 T/jug are used. It is not realistic 
to ascribe such a small moment to any kind of static magnetism. 

NMR relaxation rate T;. Because the electric quadrupole splitting of ®’Li is small, 
the NMR relaxation rate T, for every nucleus was measured by using a comb- 
shaped-pulse recovery method with an empirical stretched-exponential recovery 
function, 1 — exp[—(t/T,)°]. Ty] for 7Li for B perpendicular to the honeycomb 
plane was up to 50% larger than that for B parallel to the plane (Extended Data 
Fig. 6a), which implies stronger fluctuations of the hyperfine fields along the 
honeycomb plane. The anisotropy of T;' is smaller than the field and the 
temperature dependence and does not affect the main conclusions. 

The exponent (3 for the stretched-exponential recovery function for ’Li is close 
to the ideal value of 1 around room temperature, but decreases gradually to about 
0.6 with decreasing temperature (Extended Data Fig. 6b). The deviation from 1 can 
be ascribed to the spatial distribution of internal states, possibly associated with 
the effect of the specially distributed magnetic defects discussed in the main text. 
30.5, which corresponds to randomized dipolar hyperfine fields, is commonly 
observed in frustrated magnets, including quantum spin liquids. 

Characteristic timescale of spin fluctuations. (T;T)! is the q-averaged imaginary 
part of susceptibility \"(q, wo)/wo: 


tn 


B 


(TT)! = 2kg 


2 " 
A+(g)P x(q, Wo) 
2 ara 


where gis the g-factor. We assume that \(q, wo) takes a Lorentz form with a char- 
acteristic timescale of fluctuation T. In the limit of fast fluctuations as compared 
with NMR frequency (wg < 1/7), (TT) 1 is scaled only by 7 and does not depend 
on wo. However, if extremely slow fluctuations associated with glassy magnetism 
were present (wo >> 1/7), then (T;T)~! « (wo’7) | and soa Wy” dependence of 
(T,T)~! would emerge. To exclude the possibility of such slow spin dynamics 
dominating (T,T)~', we measured Tr for °Li (Extended Data Fig. 6c). We deter- 
mined the ratio of Ty ' for “Li and 7Liat 5T and 10K to be 0.14 + 0.01, in very good 
agreement with the ratio of on for °Li and ’Li of (6.27/16.55)* = 0.144 that is 
expected for the limit of fast fluctuations. This agreement indicates the absence of 
wy dependence. The magnetic-field-induced suppression of (TT)! originates 
from the decrease in the linear slope with respect to wo of \"(q, wo), which corre- 
sponds to the opening of a gap in the spin excitations. 

Subtraction of the nuclear contribution to the specific heat. As shown in 
Extended Data Fig. 9a, the raw specific heat Cjo:/T without any correction for 
B=0,1T and 3T exhibits a sharp T~* increase at very low temperatures on cooling, 
which is expected for a nuclear Schottky contribution. By using the following 
methods, we obtained the specific heat C/T without the nuclear Schottky con- 
tribution shown in Figs 1f and 4. For a low B between 0 T and 3 T, we estimated 
the nuclear Schottky contribution by fitting the low-temperature C,o¢/T with 
Cys/T= AT (broken lines in Extended Data Fig. 9a) and then subtracted Cys/T 
from Cjot/T to obtain the non-nuclear contribution of specific heat C/T. Above 2T, 
taking advantage of the very slow relaxation of nuclear contribution, we obtained 
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the non-nuclear contribution C/T directly by measuring the relaxation with short 
heating (heating only the non-nuclear contribution; see Extended Data Fig. 9b). In 
this high-field range, the relaxation of the nuclear contribution becomes extremely 
slow and is well separated from the fast relaxation from the non-nuclear contribu- 
tion in the time domain. Note that the extremely slow nuclear decay is consistent 
with the rapid suppression of 'H NMR (T;T)! with the application of a magnetic 
field. We checked the consistency of the two types of analysis (at low B and high B) 
at an intermediate B= 3 T. 

Estimate of entropy from the specific heat. The entropy S(T) was estimated by 
integrating C/T. Below the lowest temperature measured, we extrapolate C/T to 
T=0 by using a power-law temperature dependence of T~/? for B=0 and linear 
T under non-zero magnetic fields. 

Data availability. The data that support the findings of this study are available 
from the corresponding author on reasonable request. Source Data for Figs 1, 3 
and 4 are provided with the online version of the paper. 


3t. 


32. 
33. 


34. 


35. 


36. 


37. 


LETTER 


Bette, S., Dinnebier, R. D. & Freyer, D. Structure solution and refinement of 
stacking faulted NiCI(OH). J. Appl. Cryst. 48, 1706-1718 (2015). 

Tsujii, H., Andraka, B., Muttalib, K. A. & Takano, Y. Distributed 72 effect in 
relaxation calorimetry. Physica B 329-333, 1552-1553 (2003). 

Mendels, P. et al. Ga NMR study of the local susceptibility in kagomé-based 
SrCrgGa4019: pseudogap and paramagnetic defects. Phys. Rev. Lett. 85, 
3496-3499 (2000). 
Olariu, A. et a/. Unconventional dynamics in triangular Heisenberg 
antiferromagnet NaCrOz. Phys. Rev. Lett. 97, 167203 (2006). 

Takeya, H. et al. Spin dynamics and spin freezing behavior in the two- 
dimensional antiferromagnet NiGa2S, revealed by Ga-NMR, NQR and SR 
measurements. Phys. Rev. B 77, 054429 (2008). 

Khuntia, P. et a/. Spin liquid state in the 3D frustrated antiferromagnet 
PbCuTes0,¢: NMR and muon spin relaxation studies. Phys. Rev. Lett 116, 
107203 (2016). 

Harris, R. K. et al. Further conventions for NMR shielding and chemical 
shifts (IUPAC recommendations 2008). Pure Appl. Chem. 80, 59-84 
(2008). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


HLilt,O, 


T = 300 K 
C2/m (No. 12), Z=2 
a = 5.3489(8) A, 

b = 9.2431(14) A, 

c = 4.8734(6) A, 

B= 111.440(12)° 


Intensity (arb. units) 


5 10 15 20 
20 (°) 
Extended Data Figure 1 | Powder X-ray diffraction pattern. The indices based on the C2/m space group symmetry are also shown. 
measured powder X-ray diffraction pattern of H3Lilr,O.¢ at room The reflections that are vastly broadened owing to stacking faults are 
temperature was recorded with Ag Ka, radiation; selected reflection indicated in pink. 
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Extended Data Figure 2 | Temperature and magnetic-field dependence 
of magnetization at low temperatures. a, Temperature dependence of 
magnetization M/B under magnetic fields up to B= 14 T for the sample 
used in the specific heat measurements shown in Fig. 4. The Curie-like 
contribution that is absent in K(T) is clearly seen, originating from 
magnetic defects. The dashed line represents a power-law behaviour with 
T \? dependence. The susceptibility in the low-field limit appears to 
follow the T~!/? dependence better than the conventional Curie-Weiss 
dependence. At very low temperatures, the Curie-like contribution 
becomes independent of T for high B fields, implying the saturation of 
moment from magnetic defects. In the inset, M(T) at B= 7.0 T (green 
line) is compared with that calculated by integrating OM(T, B)/OT, 
which we obtained from S(T, B) using the Maxwell relation (red pluses). 
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b, M(B)-B curves at low temperatures. From the offset from the linear 
magnetization at high fields, we estimate the saturation moment that 
originates from magnetic defects to be 0.022 1p (as indicated by the arrow). 
This corresponds to 2% of the magnetic defects with g= 2 and a S= 1/2 
moment. The Zeeman gap in the density-of-states model in Fig. 4 is 2aj1pB 
with a = 2.9, suggesting a g-factor of 5.8 for the moment that comes 

from the magnetic defects. Incorporating g=5.8, the best estimate of the 
number of magnetic defects with S = 1/2 moments is 0.8%. This estimate 
is consistent with the estimate from the specific heat in Fig. 4, which 
indicates an entropy S(T) of 1%-2% of Rln(2) at T=5K, where R is the gas 
constant. c, B!? x OM(T,B)/OT shows a scaling with T/B similar to that for 
Cand T}, indicating that the three probes capture the same excitations. 
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Extended Data Figure 3 | Resistivity versus temperature. The dependence 
of the resistivity p on the insulating temperature T is measured on a 


polycrystalline pellet of H3Lilr.O.. The inset shows the Arrhenius plot of the 
same data, indicating the transport activation energy of approximately 0.12 eV. 
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Extended Data Figure 4 | NMR spectral parameters and estimate of the 
hyperfine coupling constant in H3Lilr,O,. a, HWHM obtained by 
performing a Gaussian fit near the top of each peak. The open symbols are 
for B perpendicular to the honeycomb plane (); the filled symbols are for 
B parallel to the plane (||). The length of the arrow corresponds to a 
hyperfine field at a Li site when a moment of 0.002 1g is placed on the 

Ir atoms. b, The integrated NMR signal intensities after T; and T, 
corrections, which is proportional to the number of nuclei under 
observation. c, Bulk magnetic susceptibility .(T) of aligned powder at 7 T 
after the subtraction of core diamagnetism, with B parallel (purple crosses) 
and perpendicular (green pluses) to the honeycomb plane. The solid lines 
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represent the intrinsic susceptibility y; calculated from x by subtracting 
the Curie contribution that originates from the impurities and/or defects. 
The dotted lines are high-temperature (above 250 K) Curie-Weiss fits, 
which yield a Curie-Weiss temperature and effective moment in a parallel 
field of Ohw =— 64Kand ule = 1.62; per Ir atom, respectively, and ina 
perpendicular field of 9G = — 175 K and lore = 1.72j1, per Ir atom. NMR 
Knight shifts K for Li measured at 2 T (purple triangles and green 
diamonds) are superposed for comparisons. d, K for ’Li plotted against yi. 
The data in a temperature region T > 150 K are well described by the linear 
relation K(T) = [An¢/(Na/p)] yj (solid line), from which we determine the 
isotropic hyperfine coupling constant App=0.44/ jp T. 
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Extended Data Figure 5 | T/B scaling of C and (T,T)~', and their fitting 
with the model density of excitations. B'/*(C/T) versus T/B (filled 
symbols, left axis) and B/?(T,T)~‘? versus T/B (open symbols, right axis) 
under various magnetic fields. All of the C(T, B) data points (closed 
symbols) and T\(T, B) data points (open symbols) fall onto the respective 
universal curves at low T/B, indicating a scaling behaviour. Because 

BYU GT) 31/2 = T,1/(T/B)-1/2, the plot for T; is another way of 
representing the same (T/B) scaling as the inset to Fig. 3b. The physical 
meaning of B"?(T,T)~/” is the Fermi average of D(E) witha 
renormalization factor B"?, which is closely related to B/?(C/T). C/T and 
(TT) ' are given as follows using the standard equations, which express a 
Fermi averaging of D(E) and D(E)’, respectively: 


C1 Pe a ee a 
: 7 J ED) SE ae J PDE) aE 


1 df 
— = — 4rhkg(y,Anr)?_ | D?(E)=-dE 
ap =~ Atha anAne)? [DET 


Here, f(T, E) = 1/{exp[(E/(kgT)] + 1} is Fermi distribution function. 

The solid and dashed lines indicate B'/?(C/T) and B!/?(T,T)~!, 
respectively, calculated using the above equations for the model for 

D(E) shown in Fig. 4c. With a = 2.9 and = 4.3 x 10° J~!? per Ir atom, 
the two scaling curves observed experimentally for B'/(C/T) and 
B‘?(T,T)“? are well reproduced by the calculations. Apr=0.44/j1g T and 
nl 27 = 16.54680 MHz T~! were used as known parameters for 7Li. The 
calculated BY?(C/T) and B’?(T,T)? show different behaviour at high 
T/B (greater than about 1), which reflects the different methods of thermal 
averaging. This difference reasonably accounts for the difference between 
the two universal curves at high T/B that was observed experimentally. 
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Extended Data Figure 6 | NMR relaxation details. a, Ratio of T;' for 'H between the relaxation curves of °Li and ’Li at 10K and 5 T, yielding a ratio 
and 7Li, and anisotropy of T, for 7Li with perpendicular and parallel fields. of T, ' for SLi and ’Li of 0.14. P; is the time period between a saturation 


The error bars are calculated using the errors in the estimate of T (see pulse and the echo sequence in the relaxation measurements. This value 
Fig. 3b). b, Exponent 3 in the stretched-exponential function describing agrees well with the squares of the gyromagnetic ratios. (= 0.6 was used in 
the NMR relaxation curve as a function of temperature. c, Comparison the calculation. 
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Extended Data Figure 7 | Estimate of the intrinsic specific heat C; by 
subtracting the specific heat due to magnetic defects. Fitting the 
specific heat C in Extended Data Fig. 5 with the model for D(E) in Fig. 4c 
gives an estimate of the specific heat due to magnetic defects. The 
best-fitting function, represented dashed line in Extended Data Fig. 5, 

is subtracted from the total specific heat C(B, T) shown in Fig. 4a. The 
residual specific heat AC is almost independent of B and gives a measure 
of intrinsic specific heat C;. AC/T appears to extrapolate almost to zero 
as the temperature approaches 0 K, within the uncertainty of the estimate 
of the contribution from magnetic defects. We do not attempt to separate 
the possible contribution from the bulk spin liquid to AC because of the 
difficulty in estimating the lattice contribution. The inset shows the plot of 
AC/T asa function of T°. 
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Extended Data Figure 8 | NMR spectra of H3Lilr,O¢ before and after 
the alignment of the powder sample. Before the alignment (top), an 
asymmetric line shape in obtained (reflecting the anisotropy in the Knight 
shift Kj versus K,; Fig. 2a), which represents a powder pattern. The middle 
and bottom curves shown the line shapes obtained after the alignment for 
B along the honeycomb plane (the magnetic easy plane) and perpendicular 
to the plane, respectively. Because of the easy-plane anisotropy of 
H3Lilr.Og, the alignment for the direction perpendicular to the plane is 
not as complete as that for the direction parallel to the plane. 
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Extended Data Figure 9 | Subtraction of the nuclear Schottky specific heat C/T at B=3 T obtained by the three different methods agree 
contribution from the raw specific heat data. a, Open circles represent reasonably well, which strongly supports the validity of our analyses. The 
the raw specific heat Cjo,/T at B= 0, 1 T and 3 T. Filled circles represent Cys/T x T~? term for B=0 T is slightly larger than that for B= 1 T. The 
the non-nuclear specific heat C/T obtained after subtracting the nuclear origin of the extra T~* contribution observed in the B=0 data remains 


contribution Cys/T « T~* (dashed lines) from Cjo,/T. Filled triangles and elusive. b, Example of the relaxation curve at B= 3 T and T= 124 mK, 
pluses indicate the non-nuclear C/T obtained directly from a time-domain which indicates the clear separation in the time domain between the fast 


measurement, either by heating for a short period of 100-200 and then relaxation due to the non-nuclear contribution and the slow relaxation due 
measuring the short relaxation for a period of 100-200 s (triangles) or to the nuclear contributions. The inset shows the temperature dependence 
by heating for a long period of 1,500-2,500s and then measuring only of the slow relaxation time constant at B=3 T. 


the short relaxation for a period of 100-200 (pluses). The non-nuclear 
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Extended Data Table 1 | Refined structural parameters of H3Lilr20g at 300K 


Atom site g x y z B (A?) 


Ir(1) 4g 1 
Li(1) 2a 1 
O71) By 1 
O(2) 4j 1 


0 0.335(3) 0 0.1(1) 
0 0 0 0.1(1) 
0.404(8) 0.323(3) 0.2295) 1.7(3) 
0.417(8) 0 0.2209)  1.7(3) 


The space group is C2/m (number 12), Z=2 and the lattice constants are a= 5.3489(8) A, b=9.2431(14) A, c=4.8734(6) A and 
8=111.440(12)°. g and B denote the site occupancy and the isotropic temperature factor, respectively. We did not refine the positions of the 
H atoms because of the small atomic scattering factor. The final R index is Rwp =4.717%. 
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Freezing on a sphere 


Rodrigo E. Guerra!*, Colm P. Kelleher, Andrew D. Hollingsworth! & Paul M. Chaikin! 


The best understood crystal ordering transition is that of two- 
dimensional freezing, which proceeds by the rapid eradication 
of lattice defects as the temperature is lowered below a critical 
threshold'~*. But crystals that assemble on closed surfaces are 
required by topology to have a minimum number of lattice defects, 
called disclinations, that act as conserved topological charges— 
consider the 12 pentagons on a football or the 12 pentamers on a 
viral capsid*®. Moreover, crystals assembled on curved surfaces 
can spontaneously develop additional lattice defects to alleviate 
the stress imposed by the curvature®®. It is therefore unclear how 
crystallization can proceed on a sphere, the simplest curved surface 
on which it is impossible to eliminate such defects. Here we show 
that freezing on the surface of a sphere proceeds by the formation 
of a single, encompassing crystalline ‘continent’, which forces 
defects into 12 isolated ‘seas’ with the same icosahedral symmetry 
as footballs and viruses. We use this broken symmetry—aligning 
the vertices of an icosahedron with the defect seas and unfolding 
the faces onto a plane—to construct a new order parameter that 
reveals the underlying long-range orientational order of the 
lattice. The effects of geometry on crystallization could be taken 
into account in the design of nanometre- and micrometre-scale 
structures in which mobile defects are sequestered into self- 
ordered arrays. Our results may also be relevant in understanding 
the properties and occurrence of natural icosahedral structures 
such as viruses”), 

The remarkable predictions of the Kosterlitz~Thouless—Halperin- 
Nelson-Young (KTHNY)'~* theory of melting on a plane have been 
verified both by experiment and simulation in systems as diverse as 
free electrons on liquid He surfaces", magnetic flux vortices in super- 
conductors'’, and monolayers of paramagnetic colloidal particles 
interacting through induced magnetic dipoles'*"!°. This last system 
is particularly well characterized—its entire phase behaviour can be 
reduced to a one-dimensional phase diagram parameterized by a single, 
dimensionless interaction parameter, I; defined as the ratio of neigh- 
bour magnetic dipole and thermal energies: 


3/2 
re (tp) A 


kaF (1) 


where A quantifies the magnitude of the dipolar pair potential, 
U(r) =A/r (where r is the separation between particles), p is the num- 
ber density of particles, kp is Boltzmann's constant, and T is the absolute 
temperature. Consistent with this definition, the liquid-crystalline 
hexatic phase that separates the isotropic fluid from the crystalline solid 
occupies a narrow window of Ivalues!>, 67 <Is70. 

Our system of charged, hydrophobic microspheres adsorbed at 
oil-aqueous interfaces is accurately described by the same interaction 
potential'®'”. Yet, unlike any system controlled by magnetic fields, 
these particles are bound to the surface by electrostatic image charge 
forces (Fig. 1b), and can uniformly decorate the surfaces of spheres, 
or other curved surfaces, without affecting the form of their inter- 
particle forces (see Supplementary Information section 2). However, 
the topology of the sphere fundamentally changes the KTHNY picture 


of ordering by elimination of defects, since at least twelve 5-coordi- 
nated disclinations (particles with pentagonal Voronoi cells and posi- 
tive topological charge) are required*. Furthermore, it has been shown 
that as the size of the system increases (at constant density) it becomes 
energetically favourable to decorate these 12 disclinations with dislo- 
cations (topologically neutral, 5-7 disclination pairs) organized into 
linear structures called ‘scars’**!*'°. The total number of dislocations 
in the system—proportional to the length of the scars—grows as R/a, 
where R is the radius of the sphere and a is the interparticle distance'®. 
This is unlike other systems, such as superfluid “He (ref. 20) and the 
two-component plasma”, that exhibit a related Kosterlitz-Thouless 
transition on a sphere, but where all topological defects may be elimi- 
nated. Consequently, since dislocations are mobile defects that destroy 
both crystallinity and rigidity*”’, it is natural to ask whether crystalli- 
zation of particles confined to the surface of a sphere is possible at all, 
or whether the proliferation of defects leads to liquid or glassy phases 
even for the strongest interactions (highest J). 

Confocal micrographs (top panel of Fig. 1a) show clear differ- 
ences between droplets with high and low values (for methods see 
Supplementary Information section 1). First, Voronoi tessellations of 
these surfaces show that disclinations (particles with topological charge 
equal to 6 less their coordination number) densely and uniformly cover 
the liquid-like sample, but are much rarer in the more ordered sample 
and are clustered in scars (middle panel of Fig. 1a). A similar pattern 
is shown by the two-dimensional bond-orientational order parameter 
that measures the orientation and degree of hexagonal order around 
each particle*"*: 
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Here, N;is the coordination number of the ith particle, and 0;(t) is the 
angle between the line connecting particle i to its jth neighbour and 
an arbitrary reference axis. Like the distribution of topological defects 
(middle panel of Fig. 1a), the time-averaged |q¢(r3 f)| field, (|W), is 
homogeneous over the entire sphere, as shown in the bottom panel of 
Fig. 1a). For the denser samples, (|2¢|), is spatially heterogeneous—the 
environment around most particles is locally hexagonal for the entire 
duration of observation, while a small number of particles, in or near 
the scars, have very low (|w6|); values. 

KTHNY theory postulates that the magnitude of this orientational 
order—as quantified, for example, by the |1(1;, £)| field averaged over 
space and time, (|¢«|)—is intimately related to the distribution of 
topological defects*. Indeed, in flat space the number of topological 
defects drops precipitously over a narrow range of [values centred 
around I~ 70, while (||) increases rapidly'’. By contrast, while the 
reduction of defects on spheres with N~ 1,500 particles is accompanied 
by increasing values of (|~6|), the crossover to high- "behaviour is 
broader (especially in the simulations) than what we observe in the flat 
layers; moreover, the number of topological defects reaches a plateau 
value much larger than the 12 required by topology (Fig. 1c). In the 
I’,N — oo limit, the number and clustering of defects has been 
predicted using continuum elasticity". 


Center for Soft Matter Research New York University, New York, New York 10003, USA. 
*These authors contributed equally to this work. 
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Figure 1 | Defects and order in spherical assemblies. a, The top panel 
shows particles adsorbed on a sphere at low density (J°= 42, left) 
appearing disordered. More densely packed particles (I’= 145, right) 
exhibit the hexagonal order and lattice lines that are characteristic of flat 
crystals. Scale bars, 201m. The middle panel shows lattice defects 
uniformly covering the disordered sphere (left), whereas the more ordered 
sphere (right) is covered mostly by 6-coordinated particles. The bottom 
panel shows values of the time-averaged bond-orientation order 
parameter, (|¢,(r)|);, of particles on the disordered sphere (left) to be much 
more uniform than those on the higher-/’sphere (right). b, Image charges 
generated by the free ions in the aqueous fluid attract hydrophobic 
poly(methyl methacrylate) (PMMA) particles with charge q and diameter d 


Yet, although these globally averaged quantities reflect the increasing 
order of particles with increasing I, they do not reflect the clustering 
of defects that is evident in the micrographs. To understand this 
clustering better, we compute gs5(r), the pair correlation between 
5-coordinated defects (Supplementary Information section 4), for 
experimental and simulated*-”® particle configurations over a wide 
range of I. We find that gs5(r) is flat for < 70—consistent with a 
random distribution of defects. However, for "> 70, we find a peak in 
&55(r) at short distances that grows and widens with increasing [— 
indicating the condensation of defects into clusters. More interestingly, 
additional peaks in gss(r) appear for values of r that correspond to the 
geodesic distances between the vertices of an icosahedron 
(Fig. 2a), making it possible to draw a football on the sphere such that 
most of the defects lie inside its pentagons (Fig. 2b). Nevertheless, this 
icosahedral ordering of defects develops gradually, and it is difficult to 
unambiguously identify isolated defect clusters until > 120. 

The strong coupling between lattice defects and particle mobility in 
crystalline solids!*”* suggests that the proliferation and spatial segre- 
gation of dislocations will also affect the dynamics of our system. To 
study this we adopt a tool used to describe glasses, labelling particles 
that move more than a distance \" over a time 7 as ‘mobile, Q=1, 
and those that do not as ‘caged; Q= 0 (Supplementary Information 
section 6)*°”’. For large values of I we observe clustering of mobile 
particles reminiscent of the clustering of topological defects (Fig. 2c). 
To investigate the connection between defects and mobility we compute 
gap(r), the pair correlation function between mobile particles (Q= 1) 
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to the oil-aqueous interface and induce dipolar repulsive forces between 
them. c, Measurements of defect fractions (top) and (|4s|) (bottom) for 
spheres decorated with N = 1,500 particles and for similar particles on flat 
surfaces. Symbols are experimental data points, with error bars defined in 
Supplementary Information section 2.2, while solid lines are simulation 
results. Flat surface packings eliminate nearly all defects and attain nearly 
perfect orientational order over a narrow range of I'values (67 S I'S 70, 
shaded). For spherical packings, the improvement in local order and 
annihilation of defects with increasing [is dramatically hindered, with a 
finite concentration of defects expected even in the ’— co limit; this is 
consistent with predictions based on continuum elasticity (dashed 
horizontal line). 


and disclinations (D, defined as particles with any coordination number 
other than six). We find that ggp(r) is almost identical to gs5(r), con- 
firming that particle mobility is strongly heterogeneous and becomes 
confined to the same icosahedrally coordinated ‘seas’ that contain the 
excess lattice defects. 

In two dimensions, long-range correlations of orientational order, 
captured by g(r) = (We(x)W¢(y))x—y|=r are a clear sign of crystallinity’; 
however, vector transport on the sphere changes angles and complicates 
the definition of a global reference coordinate system. Nevertheless, 
the icosahedral ordering of defects suggests that it may be possible to 
detect crystalline order by explicitly referencing this broken symmetry. 
We thus define an icosahedral ‘net’ by rotating an icosahedron so that 
its vertices are aligned with the positions of the defects (Supplementary 
Information section 8). Projecting the particles onto the faces of the 
icosahedron and unfolding the structure onto a plane reveals the 
remarkable global orientational coherence of particle configurations 
with high J” (Fig. 3a). We quantify this order using the icosahedrally 
referenced orientational correlation function: 


ger) = (Welridbe*(r{)) cosMrery/R2)=r (3) 


where w(r;) is the icosahedrally referenced value of the bond- 
orientational order parameter (Fig. 3b, Supplementary Information 
section 8.2). We note that randomly oriented icosahedral nets, or nets 
based on polyhedra with different symmetries, will not produce such 
coherence (Supplementary Information section 8.5). 
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Figure 2 | Emergence of icosahedrally coordinated defect ‘seas’. a, Pair 
correlation of 5-coordinated disclinations, gss, for experimental (top) 
and simulated (bottom) spheres decorated with N ~ 1,500 particles. 
Disordered spheres are uniformly randomly covered with lattice defects, 
leading to a flat and featureless g;;. As order increases so does a coherent 
defect-density modulation with peaks at 0, 6), 02 and 1, consistent with 
their segregation into icosahedrally coordinated ‘seas. b, Rendering 

of experimentally determined 5- and 7-fold coordinated lattice defect 
positions over the surface of a suitably oriented football*° makes the 


The emergence of this new broken symmetry now raises the question 
of whether the extreme retardation in the growth of (||) is a finite-size 
effect, or arises because the spherical geometry fundamentally changes 
the form of the freezing transition, perhaps requiring an additional 
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Figure 3 | Icosahedral map projections and long-range order. a, The left 
panel shows a reference icosahedron, oriented by fitting its vertices to the 
positions of the defect particles. The right panel shows that unfolding this 
icosahedron reveals long-range order hidden by the extra disclinations. 

b, Orientational correlation functions, gi(r), computed using the reference 
icosahedra (see text and Supplementary Information section 8). For the 
samples at low J; orientational correlations decay rapidly, whereas at high I, 
hexagonal order extends across the entire sphere. At intermediate values of 
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icosahedral ordering apparent. c, Distribution of ‘mobile’ particles (red, 
see text) over the surface of spheres with [= 42 and P= 145. After a 

lag time 7, mobile particles appear randomly over the surface of the 
disordered sphere (left panels) and quickly cover its surface. Particles on 
the more ordered sphere (right panels) have far lower absolute mobilities 
(‘caged’ particles are white), and the mobile particles that appear are 
segregated into isolated ‘seas. d, Pair cross-correlations of all defects and 
mobile particles for the same spheres shown in a show similar ordering to 
&s55. demonstrating the spatial coincidence of defects and mobility. 


order parameter to describe it. Consequently, to explore the N — 00 
limit, we supplement our simulations of spheres decorated with with 
N=1,500 particles, with simulations with N = 3,000, 6,000 and 12,000 
particles*”°, and evaluate two order parameters: the icosahedrally 


b Experiment 


Geodesic distance (rad) 


I, the absence of well defined defect clusters means that the correlation of 
the best-fit icosahedron to the orientational order is low, so many 
independent configurations must be sampled to produce accurate g(r) 
curves. Since our experiments sample only a small number of independent 
configurations, the correlation functions in the top panel of b in the range 
85 S I'S 110 sometimes reach negative values at large r, whereas 
simulation results in the bottom panel are consistent with thorough 
sampling. 
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Figure 4 | Icosahedral order and finite size scaling. a, Scaling of the 
normalized, three-body, three-dimensional bond-orientation of the 
defects, We, with and system size for simulated particle configurations. 
Icosahedral order grows approximately linearly beyond a critical value 

of I, where the slope of linear fits to the data (dotted lines) appears to 
diverge as N — oo. b, Scaling of the average icosahedrally referenced bond 
order parameter, (|W|*)"/?, with and system size. The rate at which this 
order parameter grows increases with system size, and its N — 00 
extrapolated values remain positive for all J’ that correspond to the 
flat-space solid. 


referenced two-dimensional bond-orientation of the particles 
Y= = Li wé(r;) and the normalized, three-body, three-dimensional 
bond-< orientation of the defects?® We: 
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where the first quantity in the summand is the Wigner 3-j symbol and 
Pom = VER YX; Yom(9is $;) (Yom are the sixth-order spherical harmonics, 
and (@, 0;) are the polar coordinates of the ith defect). This order 
parameter is particularly sensitive to the presence of icosahedral sym- 
metry, and is normalized so that We= fora perfect icosahedron. 

Plots of these quantities show that increasing the system size 
increases the rate at which orientational and icosahedral order increase 
with increasing I" (Fig. 4a, b). A polynomial extrapolation of (|W)? )!/? 
in powers of 1/R« 1/ aN shows that, in the thermodynamic (N — oo) 
limit, (|Wé|?)!/? vanishes for ’< 67, but remains finite for > 70, 
coinciding with the two-dimensional liquid and crystal phase bound- 
aries. Similarly, Wg is zero for lower I, and begins to increase roughly 
linearly—with a slope approximately proportional to N—above a 
critical threshold. We note that both of these order parameters 
(\wél?)1/? and W, exhibit transitions close to the I’values of the liquid- 
hexatic and hexatic-solid transitions in flat space as R — oo. 


(4) 


m,m2,M3=—6 
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The elucidation of order by the unravelling and flattening of an 
appropriate polyhedral net may be useful in other contexts, such as 
liquid crystalline orientations on curved surfaces (for example, the 
tetrahedrally symmetric, baseball-like texture of a nematic confined 
to a thin shell, Supplementary Information section 8.6”). As in our 
study, the need for both crystalline and polyhedral order parameters 
has been invoked in recent models of assembly of icosahedral viral 
motifs'®. Moreover, the segregation of defects to symmetric sites, 
and the concomitant mobility near these sites, should prove useful 
in designing structures where both rigidity and fluidity are desired 
in specific areas. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Data Availability Source Data for Figs 1-4 are provided with the online version of the 
paper. Coordinates and trajectories of particles identified from confocal micrographs 
for every experimental sample referred to in the paper are available from the Open 
Science Framework repository ‘Freezing on a Sphere’?!. The same repository 
contains a snapshot of every simulated dataset referred to in the paper (each stored 
as an XML file) and corresponding Python scripts, which, together with the HOOMD- 
blue plugins included in Supplementary Information, can be used to reconstitute 
all of the simulation data presented here. Raw confocal image data and complete 
simulation results are available upon request. 


Code Availability Numerical simulations were performed using HOOMD-blue v1.3.3 
(refs 23, 24) with double-precision arithmetic and custom plug-ins for pairwise 
interaction potentials and Langevin dynamics integrators. Experimental and 
simulated datasets were analysed using software written in MATLAB, CUDA, and C. 
These analysis routines and the HOOMD plug-ins are available in Supplementary 
Information. 
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Southern Hemisphere climate variability forced by 


Northern Hemisphere ice- 


sheet topography 


T. R. Jones', W. H. G. Roberts?, E. J. Steig?, K. M. Cuffey*, B. R. Markle? & J. W. C. White® 


The presence of large Northern Hemisphere ice sheets and reduced 
greenhouse gas concentrations during the Last Glacial Maximum 
fundamentally altered global ocean-atmosphere climate dynamics’. 
Model simulations and palaeoclimate records suggest that glacial 
boundary conditions affected the El Nifio-Southern Oscillation”, 
a dominant source of short-term global climate variability. Yet little 
is known about changes in short-term climate variability at mid- to 
high latitudes. Here we use a high-resolution water isotope record 
from West Antarctica to demonstrate that interannual to decadal 
climate variability at high southern latitudes was almost twice as large 
at the Last Glacial Maximum as during the ensuing Holocene epoch 
(the past 11,700 years). Climate model simulations indicate that 
this increased variability reflects an increase in the teleconnection 
strength between the tropical Pacific and West Antarctica, owing to 
a shift in the mean location of tropical convection. This shift, in turn, 
can be attributed to the influence of topography and albedo of the 
North American ice sheets on atmospheric circulation. As the planet 
deglaciated, the largest and most abrupt decline in teleconnection 
strength occurred between approximately 16,000 years and 15,000 
years ago, followed by a slower decline into the early Holocene. 

During glacial—interglacial transitions, the coupled ocean—atmosphere 
system shifts between stable states, driven by large climate forcings, 
including Milankovitch orbital cycles, greenhouse gas concentrations, 
and the decay of continental ice sheets!. The state changes are observed 
in palaeoclimate proxy archives such as ice cores at high latitudes, and 
lake sediments and speleothems in the tropics and mid-latitudes. 
The state of the tropical Pacific is particularly important because this 
region affects the generation and communication of climate anomalies 
between the low and high latitudes*”. 

Whether the nature of the El Nifilo-Southern Oscillation (ENSO) 
changed between the Last Glacial Maximum (LGM) and the Holocene 
is an important question*®. Proxy studies of ENSO variability during 
the LGM are contradictory”®, and the results of modelling studies 
are similarly ambiguous”. One obstacle to understanding past ENSO 
variability is disentangling changes in ENSO itself from changes in its 
influence on climate outside the tropics, that is, the strength of ENSO- 
related teleconnections, which can vary over time’. 

Tropical Pacific climate variability is an important driver of interan- 
nual to multidecadal climate variability in West Antarctica*°. Warm sea 
surface temperatures (SSTs) in the tropical Pacific drive the propaga- 
tion of atmospheric Rossby waves towards the high latitudes, affecting 
weather systems that develop over the Amundsen Sea*?. Variability 
in the Amundsen Sea region dominates climate variability over the 
adjacent West Antarctic Ice Sheet (WAIS)>!™!!, Over longer timescales, 
the influence of tropical climate variability on the Amundsen Sea may 
have implications both for the stability of the ice sheet!” and for the 
ventilation of carbon dioxide from the Southern Ocean!’, 

The influence of tropical variability on the climate of West Antarctica 
is reflected in ice core records from the WAIS. In particular, the oxygen 


and hydrogen isotope ratios of water are sensitive to temperature 
and atmospheric circulation anomalies associated with large ENSO 
events'“. Such records might therefore yield important insights about 
changes in tropical climate variability and tropical-to-extratropical 
teleconnections during glacial—interglacial transitions. In this study, 
we use a high-resolution hydrogen isotope (SD) record from the WAIS 
Divide ice core (WDC) to investigate about 31,000 years (31 kyr) of 
high-frequency climatic variability. The data were produced using 
laser absorption spectroscopy coupled with a continuous flow analysis 
system!°. The record is the longest continuously measured water 
isotope record ever recovered, with an effective sample resolution of 
5mm anda maximum difference in age between consecutive samples 
of 0.33 years (Extended Data Fig. le). 

We use spectral techniques (see Methods) on the WDC 8D record 
to analyse West Antarctic climate variability (Fig. 1). Periods longer 
than 3 yr are preserved throughout the 31-kyr record (Extended Data 
Fig. 1a). To account for attenuation of the 6D signal by diffusion in 
the firn column and solid ice’®, we apply a diffusion correction on 
consecutive 500-yr data windows (see Methods; Extended Data 
Figs 1b, 2c). We isolate the amplitude of 3-7-yr and 4-15-yr 6D 
variations by averaging power density over these frequency ranges. 
The 3-7-yr range is typical of a high-frequency spectral peak seen 
in observations of the Southern Oscillation Index (Extended Data 
Fig. 1c), while the 4-15-yr range is less affected by diffusion 
(Extended Data Fig. 1a) and captures the decadal variability observed 
in the Southern Oscillation Index (Extended Data Fig. 1c), tropical 
Pacific coral isotope records’’, and in modern observations in West 
Antarctica? 4. 

We find that the mean amplitudes of the diffusion-corrected 
variability in the 3-7-yr and 4—15-yr bands are elevated in the Late 
Glacial (16-31 kyr ago) period relative to the Holocene (0-10 kyr ago) 
(Fig. 1c, d). An approximately 50% decrease in amplitude occurs from 
about 16 kyr to 10 kyr ago in both time series. This decline occurs in 
two steps: an abrupt decrease about 16 kyr ago followed by a second 
but less pronounced decline during the period 13-10 kyr ago. Neither 
time series reveal other substantial changes in variability during the 
Holocene or during the Late Glacial. We use an objective algorithm (see 
Methods) to detect an initial statistically significant (P < 0.05) decline 
in the diffusion-corrected 4-15-yr time series based on 500-yr data 
windows (Extended Data Fig. 2a, b). We find that the decline began 
16.44 + 0.30 kyr ago (all ages are given relative to the present, 1950 ap). 
Identical results are obtained using the 3-7-yr time series. The timing 
of the initial decline is robust among different detection techniques. 
Using a subset of 100-yr data windows for the diffusion-corrected 
4-15-yr time series (Extended Data Fig. 2c, d), the decline occurs 
16.24 +0.17 kyr ago. The timing of the decline is not an artefact of the 
diffusion correction. Using the raw data in the 4-15-yr band for 500-yr 
data windows (Fig. 1d), an approximately 40% decrease in amplitude 
occurs from about 16 kyr to 10 kyr ago, with an initial decline occurring 
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Figure 1 | WDC high-frequency signal strength. a, The raw, high- 
resolution WDC 8D water isotope record (grey) and a 15-yr average (red). 
b, Power density ratio for 15 kyr before and after the primary shift in WDC 
variability (that is, 16-31 kyr ago relative to 0-15 kyr ago); raw data in grey 
and diffusion-corrected data in orange. For centennial periodicities, the 
mean ratio is 1.3, whereas the mean ratio for raw and diffusion-corrected 
periods for the 4-15-yr band is 1.9 and 2.5, respectively. The blue and 


15.94 +£0.30 kyr ago. Conservatively, this places the decline in WDC 
variability between 16.74 kyr to 15.64 kyr ago, with a central estimate 
of 16.24 kyr ago. 

The climate shift about 16 kyr ago coincides with large-scale geo- 
logic events observed in proxy records across the globe. A substantial 
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green intervals show 3-7-yr and 4-15-yr variability, respectively. The raw 
data ratio is lower than one at <3 yr owing to increased mean diffusion 

in the glacial period relative to the Holocene’®. Periods >4 yr are not 
substantially influenced by diffusion (Extended Data Fig. 1a). c, d, Plots 
of diffusion-corrected relative amplitudes using 500-yr data windows, 
normalized to the most recent value, for 3-7-yr variability (c) and 4-15-yr 
variability (d). Dashed lines show 1o uncertainties (see Methods). 


lowering of the Laurentide Ice Sheet surface is estimated to have 
occurred 16 kyr ago (from C dating)'®, corresponding to the Heinrich 
Event 1 iceberg discharge event in the Northern Hemisphere (Fig. 2). 
The Heinrich Event 1 ice-rafted debris is of Laurentide origin”, indicat- 
ing abrupt changes in ice mass and the topography of the Laurentide Ice 


Figure 2 | Indicators of oceanic and 
atmospheric variability. a, Tropical Pacific- 
West Antarctic teleconnection strength from 
HadCM3 (see Fig. 3). The open red dots 
indicate when the teleconnection strength 

is 95% significantly different from the pre- 
industrial period using a two-tailed Student's 
t-test. b, WDC diffusion-corrected 4-15-yr 
variability; dashed lines are 1o uncertainties (see 
Methods). c, d, Hydrologic variability recorded 
in an Indonesian lake sediment core”! (c) and 
Bornean stalagmites”° (d). e, Relative sea level 
(solid line) and confidence interval (dashed 
lines)**, with the estimated duration of melt 
water pulse (MWP) 1A”. f, The NGRIP ice core 
8!80 record”’ from central Greenland, showing 
the Younger Dryas (YD), Bolling-Allerod (BA), 
and Dansgaard-Oeschger (DO) events 2-4. The 
dashed black line represents the maximum about 
16 kyr ago in North Atlantic ice-rafted debris, 
corresponding to a massive freshwater discharge 
during Heinrich Event 1'*. The grey block is 

the estimated duration of Heinrich Event 1”. 
The black arrow is the estimated timing of the 
flooding of the Sunda Shelf?!>*4. 
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Sheet. The Cordilleran ice sheet also declined rapidly at this time’, and 
the lowering of the combined Laurentide-Cordilleran ice sheets (LCIS) 
has been suggested as a possible forcing mechanism for the transition 
from dry to wet conditions in Indonesia”! (Fig. 2c). Taken together, the 
similarity in timing of these various records suggests a common link. 

To investigate the mechanisms responsible for changing high- 
frequency variability at WDC, we use the HadCM3 fully coupled 
ocean-atmosphere model to simulate climate in thousand-year steps 
from 28 kyr to 0 kyr ago”* (see Methods). We assess variability in 
both the Amundsen Sea region and the tropical Pacific Ocean. Since 
the isotopic composition of ice at WDC is indicative of atmospheric 
circulation in the Amundsen Sea region" (Fig. 3a), we examine the mid- 
level flow in the model, represented by the 500-hPa geopotential height 
field. The amplitude of the simulated variability in the mean 500-hPa 
geopotential height field (Zas,) of the Amundsen Sea Low (ASL) 
region (55°-70° S, 195°-240° E) follows the same trend as the obser- 
vations from WDC: more variability during the LGM, less variability 
during the Holocene, and a decline in variability 16 kyr ago (Fig. 4a). 
We examine the tropical Pacific influence (Zpa-) on the modelled 
Zasy by linearly removing the effect of the basin-wide (150°-270° E, 
5° N-5° S) tropical Pacific SST (SST;,,.) from the Zs, time series 
(Fig. 4e, f). The amplitude of the residual ASL variability (Ziocai) is 
essentially constant over the past 28 kyr (Fig. 4a), demonstrating that 
the change in the ASL is connected to tropical variability. 

The increased Zag, variability at the LGM could be the result of 
either increased tropical variability or a change in efficiency of the 
teleconnection between the tropics and the high latitudes, forced 
by a change in the tropical mean state. To separate these processes, 
we compute the strength of the teleconnection between the tropics 
and ASL by constructing composite maps. These composites 
show the average response of the 500-hPa geopotential height 
field to ENSO events in the tropical Pacific (see Methods). We 
use a composite that imposes both an upper and lower limit 
on the size of SSTpac (£1.25 to 2.5 times the pre-industrial 
standard deviation) to eliminate the possible influence of stronger or 
weaker ENSO events during the glacial period. Composites with or 
without this upper limit are indistinguishable. The results demonstrate 
that, in our HadCM3 experiments, the change in Zags, variability is 
the result of a more efficient teleconnection, rather than a change in 
ENSO variability (see Methods). A time series of the strength of the 
simulated teleconnection over the past 28 kyr (Figs 2a, 4b) shows that 
the teleconnection is stronger through the LGM until 16 kyr ago, at 
which time it rapidly decreases. This is in agreement with the modelled 
change in variability in Zs; and the observed timing of change in the 
WDC record. A change in the LGM teleconnection strength is also 
evident in eight out of eleven of the models of the Paleoclimate Model 
Intercomparison Project (PMIP2/3) (Extended Data Fig. 7). This is 
remarkable given the diversity of the response of the tropical climate 
of these models to LGM forcing”’. 

The stronger teleconnection in the glacial period in our simulations 
suggests a change in the tropical Pacific mean state. Such a change could 
result from the lowered greenhouse gas concentrations, the altered 
orbital forcing, or the presence of continental ice sheets. In HadCM3, 
we test each of these boundary conditions individually to evaluate their 
impact (see Methods). The greenhouse gas concentrations and orbital 
forcing cause a negligible change, whereas the influence of continental ice 
sheets is statistically significant (P < 0.05). Two processes associated with 
continental glaciation may be important for changing the teleconnection: 
the lowering of sea level, leading to the exposure of the shelf seas in the 
West Pacific”4, and the topographic and albedo forcing of the LCIS. 
When considered separately, the lowered sea level causes a small change 
in the teleconnection that is statistically significant (P< 0.05) only to the 
north of the ASL region (Fig. 3e). Since flooding of West Pacific shelf seas 
is thought to have occurred about 13 kyr ago*’, this may be a possible 
cause of the decline in WDC variability between about 13-10 kyr ago, 
but cannot explain the larger step change about 16 kyr ago. 
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Figure 3 | HadCM3 teleconnection strength. a, Map of the correlation 
coefficient R? for the 500-hPa geopotential height (Z500) and WAIS 
Divide 8D, using the isotope-enabled HadCM3 model for the pre- 
industrial period. Contours are filled (not white) when the statistical 
significance exceeds 95%. b-e, Composite maps of the annual average 
500-hPa height anomaly (AZ500) for ENSO events, including the 
pre-industrial (b), and the differences between three different model 
experiments and the pre-industrial (c—e). Contours are filled (non- 
white) when statistical significance exceeds 95% using a Monte Carlo 
test. The green box is the Amundsen Sea region over which we calculate 
the teleconnection strength from the mean 500-hPa height anomaly. The 
purple square is the WAIS Divide ice core site. The blue box in b indicates 
the area shown in c-e. The contour interval is 5m in b and 2.5m ine. 
Negative contours are dashed. 


The topography and albedo of the LCIS force a large change to the 
teleconnection strength about 16 kyr ago, which is statistically signifi- 
cant (P < 0.05) across the ASL region (Fig. 3d). This timing is consistent 
with major deglacial changes to the LCIS'®. To separate the effects of 
topography and albedo, we perform two tests. First, using pre-industrial 
settings, we introduce the LGM-LCIS albedo and topographic effects 
into the model. In this case we find that almost the full LGM telecon- 
nection strength change is realized (Fig. 3d). Second, we remove the 
LGM-LCIS topographic effect, while keeping the albedo effect. Here, 
the teleconnection pattern changes, but is not statistically significant in 
the ASL region. Our results therefore demonstrate a link between the 
topography of the LCIS and multi-year to decadal climate variations 
in West Antarctica—a previously undocumented inter-polar telecon- 
nection mechanism. 

When the LCIS surface is high, the circulation in the North Pacific 
changes, with a persistent annual mean Aleutian Low that is deeper 
and farther south, a response that is consistent amongst climate 
models”°. This weakens the mean winds in the West Pacific, causing 
the warm pool to expand eastward, as occurs during an El Nifio event. 
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Figure 4 | HadCM3 mechanistic attribution. a, The variance of the 
height of the 500-hPa pressure surface (Z500) for the Zasr (solid line) 
and Zioca (dashed line) indices, computed from monthly mean data that 
is filtered with a 4-15-yr band pass filter. The markers are unfilled if the 
variance of Zags, at each time slice is 95% significantly different (F-test) 
to that of 0 kyr ago. At 21 kyr ago, the orange triangle shows Zs, for 
the ‘21ka LCIS only’ scenario, and the green triangle shows Zasr for the 
‘21ka Shelf Exp’ scenario (that is, the effect of sea level change in the 
Maritime Continent region). Each of these scenarios is defined in 
Supplementary Data. b, The modelled teleconnection strength between 
the tropical Pacific and West Antarctica (solid red line; open red dots 
indicate statistically significant differences from pre-industrial ). c, The 
simulated difference in mean annual rainfall between the central Pacific 
and the Maritime Continent (average of 20° N-20° S, 145°-190° E minus 


The atmospheric convection moves with the warmer waters away 
from the Maritime Continent (the part of Southeast Asia comprising 
Indonesia, the Philippines, Papua New Guinea and nearby countries) 
(Fig. 4c); hence, rainfall decreases in this region, consistent with the 
LGM drying inferred from lake sediments in Indonesia”! (Fig. 2c) and 
stalagmites in Borneo”® (Fig. 2d). The shifted convection then causes 
a change in the location of the atmospheric heating that occurs during 
ENSO events. Changing the location of this heating alters the structure 
of the extra-tropical Rossby waves that are excited during ENSO events, 
so that, when the LCIS is present, there are additional circulation anom- 
alies in the Southern Hemisphere high latitudes. Although there are 
large changes in how the Rossby waves are forced from the tropics, there 
is no change in how they propagate within the Southern Hemisphere. 
Since Rossby waves are the primary mechanism responsible for the 
tropical Pacific-West Antarctic teleconnection*», the LCIS can force 
a change in West Antarctic climate variability. Hence, when an ENSO 
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average of 20° N-20° S, 100°-145° E), shown as an anomaly relative to the 
pre-industrial period (light blue line). Also shown are the precipitation 
anomalies over the Maritime Continent only (dashed grey line). 

d, Northern Hemisphere ice area*® (dashed green line) and average height 
of the LCIS*® (blue line). In all panels, the dashed vertical line shows 16 kyr 
ago, as in Fig. 2. e, Map of the change in the 500-hPa height field variance 
between 21 kyr ago and the pre-industrial period. f, As in e, except for the 
Zpac Variability that is linearly related to ENSO (that is, the part removed 
from Zasr to yield Zjoca1). The green box is the Amundsen Sea region over 
which we calculate the indices in a. The purple square is the WAIS Divide 
ice core site. The contour interval is 40 m’, with colours changing every 

80 m?. Negative contours are dashed. Colours are plotted for 95% statistical 
significance using a Monte Carlo test. 


event occurred during the LGM—even if it were no bigger than during 
the modern era—it had a larger impact on West Antarctic climate than 
during the modern era. 

Our results identify an influence of Northern Hemisphere ice-sheet 
topography on the climate system. By altering the coupled ocean- 
atmosphere circulation, the decay of the LCIS affected the strength 
of interactions between the tropical Pacific and high southern lati- 
tudes, reducing interannual and decadal variability in West Antarctica 
by nearly half. Although several abrupt climate events occur during 
the past 31 kyr, including Dansgaard—Oeschger events, the Bolling- 
Allerod, the Younger Dryas and melt water pulse 1A2427 (Fig. 2e, f), 
the interannual and decadal climate variability in West Antarctica and 
the patterns of rainfall in the tropical West Pacific do not appear to be 
affected. Instead, the initial reduction in West Antarctic variability at 
about 16 kyr ago corresponds to a maximum in North Atlantic ice- 
rafted debris layers during Heinrich Event 1'*. It was this ice-sheet purge 
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that probably reduced LCIS topography beyond a critical threshold, 
altering the interhemispheric climate dynamics of the Pacific basin, 
even as separate abrupt climate events continued to occur in the 
Atlantic basin and further afield. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Water isotope data. The WDC water isotope record (Fig. la) was analysed on a 
continuous flow analysis system!° using a cavity ring-down spectroscopy instru- 
ment (Picarro Inc. model L2130-i). The data are reported in delta notation relative 
to Vienna Standard Mean Ocean Water (VSMOW, 6!8O = &D = 0%o0), normalized 
to the Standard Light Antarctic Precipitation (880 = —55.5%o0, 8D = —428.0%o) 
scale. WDC is annually dated, with accuracy better than 0.5% of the age in the 
range 0-12 kyr ago, and better than 1% of the age in the range 12-31 kyr ago*)?. 
Frequency domain analyses. Spectral conversion. We use the MultiTaper method 
fast Fourier transform technique to calculate spectral power densities**** of the 
measured water isotope time series. As in other palaeoclimate studies*, we use the 
pmtm.m routine of P. Huybers (http://www.people.fas.harvard.edu/~phuybers/ 
Mfiles/). Before spectral analysis, the isotope data are linearly interpolated at a 
uniform time interval of 0.05 yr. 

High-frequency signal attenuation. High-frequency water isotope information in ice 
cores is attenuated by diffusion in the firn column and deep ice!®?*-*°, Frequency 
spectra reveal the amount of signal attenuation as declines in the amplitude of 
a given frequency through time, relative to lower frequencies (Extended Data 
Fig. la). For WDC, the annual signal (1 yr) is indiscernible at ages > 14 kyr ago. The 
2-yr signal is indistinguishable from 17-19 kyr ago. Signals >3 yr are detectable 
throughout the past 31 kyr, while signals >4 yr are not substantially attenuated by 
diffusion (Extended Data Fig. 1a). 

Gaussian determination of diffusion lengths. The quantitative effects of diffusion 
can be represented by the convolution of a Gaussian filter with the original 
water-isotope signal deposited at the surface and subsequently strained 
by ice deformation and firn compaction***? (Extended Data Fig. 1b). The power 
density spectrum observed in the ice core record, P(f), after diffusion, is 
P(f) = Bf) exp[—(2nfo)?], where P,(f) represents the power spectrum of the 
undiffused signal, f= 1/2 is the frequency, . is the signal wavelength, z is the depth, 
and a; is the diffusion length. Fitting a Gaussian to P(f) defines a standard deviation 
of with units of per metre. The conversion 0, = 1/2n./2 x 1/oy then yields the 
diffusion length cin units of metres!®. The diffusion length expressed in units of 
time is 0; = 02/Xayg, Where Aqvg is the mean annual layer thickness (in metres per 
year) at a given depth (Extended Data Figs If, 2c). The diffusion length quantifies 
the statistical vertical displacement of water molecules from their original position 
in the ice sheet. In the present study, we use our previously published WDC 
diffusion lengths, calculated for 500-yr data windows throughout the interval 
0-29 kyr ago!®, 

Natural logarithm determination of diffusion length. The variable ofcan also be deter- 
mined by using the slope of the linear regression of In[P(f)] versus f*. This provides 


a means of estimating diffusion-length uncertainty’®. Here, of=f1/[2|(7mmn)|] > 


where mp is the slope of the linear regression over the interval from 0.01 (cycles per 
metre)” to the value at which systematic noise from the ice core analysis system 
overwhelms the physical signal. The point where noise dominates appears as a 
‘kink or ‘bend’ in the decay of In[P(f)]. A maximum and minimum slope is fitted 
within the standard deviation of the linear regression to determine an uncertainty 
range for oy. 

Power density diffusion correction. Diffusion of a water isotope signal in an ice sheet 
reduces the power of high-frequencies, so P(f) takes the form of quasi-red noise. 
Given that WDC Holocene spectra show constant power density in the frequencies 
largely unperturbed by diffusion (periods >4 yr), we use a white-noise normali- 
zation to estimate the original, pre-diffusion power density spectrum. Specifically, 
we calculate R,(f) = P exp(4n7f? o; ), where P is the observed power density (in 
units of %o? yr), fthe frequency (in units of per year), and 0; is the diffusion length 
(in units of years). We report the average power density for the diffusion-corrected 
3-7-yr, 4-15-yr, and 3-30-yr bands (Fig. 1c, d, Extended Data Fig. 1d) calculated 
as the integral of power density divided by the frequency range. The uncertainty 
on these power density diffusion corrections is determined using the uncertainty 
range for diffusion lengths discussed in the section ‘Natural logarithm determina- 
tion of diffusion length. 

Change point detection. We fit linear regressions to windows of data within the 
3-7-yr and 4-15-yr relative amplitude time series, using window sizes of 2,500 yr 
to 10,000 yr. We calculate the P value for the F-test to determine whether the 
slope of the regression in each window is statistically different from zero 
(Extended Data Fig. 2b). We make no a priori assumptions about the timing or 
size of statistically significant change and take an a posteriori confidence level 
equivalent to 95%:a = 1 — 0.95!/", where n is the number of statistical test real- 
izations*’. The first statistically significant change occurs 16,440 yr ago for the 
4-15-yr diffusion-corrected data (all ages are given relative to the present, 
1950 ap). The 3-7-yr diffusion-corrected data yields identical results. The 
first statistically significant change in the 4—15-yr raw data (not corrected for 
diffusion) occurs 15,940 yr ago. 


Uncertainty in the timing of initial change includes the spectral window resolu- 
tion of 500 + 250 yr and the WDC14 age scale (1% for ages >12 kyr ago”). For 
the initial change centred at 16.4 kyr ago and 15.9 kyr ago, the age scale imposes an 
uncertainty of +164 yr and +159 yr, respectively. Adding the above in quadrature 
yields initial timing uncertainties of just less than +300 yr. We also estimate the 
change point visually using a subset of 4-15-yr diffusion-corrected data, calculated 
using 100-yr non-overlapping windows between 16.64 kyr ago and 15.54 kyr ago. 
The initial decline occurs 16.24 kyr ago, with uncertainty on the window length 
of £50 yr and a dating uncertainty of +162 yr. Added in quadrature, this gives an 
uncertainty value of +170 yr. 

HadCM3 model simulations. Model setup. We use the fully coupled ocean- 
atmosphere model HadCM3*!”. This model has been shown to simulate the cli- 
mate in the tropical Pacific very well, including in its response to glacial forcing”. 
We simulate the climate over the past 28 kyr in a series of snapshots run every 
thousand years”. For each snapshot, we prescribe the orbital forcing’’, green- 
house gas concentration****, and ice-sheet topography and sea level*”. We use 
a suite of simulations to test the sensitivity of the model to individual boundary 
conditions, including ‘Full 28-0ka, ‘Full 21ka, ‘21ka Orbit + GHG’ (where GHG 
is greenhouse gas), ‘21ka Ice Sheets, ‘21ka LCIS’ (where LCIS is the combined 
Laurentide—-Cordilleran ice sheets), ‘21ka Shelf Exp? and ‘21ka LCIS albedo. Each 
of these simulations is defined in Supplementary Data. All simulations are run 
for at least 500 yr with analysis made on the final 200 yr of each simulation. (The 
term ‘ka refers to ‘kyr ago.) 

Long-term evolution of ASL variability. The variability in the WDC water isotope 
record on interannual and longer timescales is related to changes in the large- 
scale atmospheric circulation, particularly in the Amundsen Sea Basin" (Fig. 3a). 
These changes in the circulation can be characterized by the ASL***”. To assess 
the amount of variability in the ASL, we compute an ASL index of monthly mean 
500-hPa height averaged from 55°-70° S and 195°-240° E (Zasr). We use a 4-15-yr 
band pass filter to isolate variability on the same timescale as the WDC record. 
Extended Data Fig. 3 shows the variance of Zas;, within each snapshot of a thou- 
sand years, using the ‘Full 28-Oka simulations. From 28 kyr until around 15 kyr 
ago, the ASL is more variable than during the Holocene (95% significant, F-test). 
Both the magnitude and timing of the decrease are comparable to those observed 
in the WDC water isotope record. 

Internal versus forced variability of ASL. To assess how much of the change in the 
simulated Zasr is due to the tropics, we linearly remove the average of SST across 
5° N-5° S and 150°-270° E (SSTpac) from the Zs. This operation is performed by 
regressing Zags; Onto SSThac, as shown by the equation Zpac = SSTpac X regress(SSThac, 
Zast). We then subtract Zpac from Zags; to obtain the component of Zag, that is 
unrelated to the tropical Pacific: Zjocai=Zast — Zpac- The variance of Zjocal is plotted 
in Extended Data Fig. 3. The results show that there is no statistically significant 
change in the ASL variability that is unrelated to tropical Pacific SST over the 
deglaciation (95% significance, F-test). We compare these results, obtained with the 
4-15-yr band pass filtered data of monthly mean model output, to the annual mean 
data with no filtering (Extended Data Fig. 3a,b). We obtain very similar results for 
both outputs. The tests described below use the annual mean data. 

The tropical Pacific-to-ASL teleconnection. The tropical Pacific-to-ASL telecon- 
nection has been well documented**“”. To show this teleconnection in HadCM3, 
we construct composite maps of 500-hPa height for years when the annual SSTpac 
anomaly exceeds +0.81 °C (Extended Data Fig. 4). This threshold is chosen so that 
SSTpac anomalies exceed 1.25 standard deviations in a pre-industrial simulation. 
We include cold ENSO events by multiplying the 500-hPa height field by (— 1). 
Note that computing SST, : over the entire tropical Pacific basin eliminates biases 
that may arise from changes in the pattern of SST variability, either in response to 
forcing or as a result of intrinsic ENSO variability. 

The pre-industrial composite pattern that we obtain for the atmospheric 
response to tropical Pacific SST shows greatest amplitude in the Amundsen Sea 
region, with a positive height anomaly for warm ENSO events (Extended Data 
Fig. 4a). A similar pattern is obtained from reanalysis data using either regression 
analysis“* or compositing**°°. The composite is statistically significant across the 
Southern Hemisphere. 

Using HadCM3, we find a similar composite pattern for the pre-industrial 
period (Extended Data Fig. 4a) and for the ‘Full 21ka@ simulation (Extended Data 
Fig. 4b). The difference between the two patterns is shown in Extended Data Fig. 4c; 
a Rossby-wave response emanating from the western tropical Pacific is evident. 
This response is consistent with other studies that show the tropical Pacific-to- 
ASL teleconnection*>*!, Using each snapshot in the ‘Full 28-0ka’ simulations, 
we evaluate the amplitude of the tropical Pacific-to-ASL teleconnection as the 
mean of the composite averaged over 55°-70° S and 195°-240° E (Extended Data 
Fig. 5). There is a dramatic decrease in the strength of the teleconnection around 
16 kyr ago. 
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Teleconnection changes resulting from ENSO changes using HadCM3. We find that 
the simulated variance of SST}: anomalies 21 kyr ago (= 0.78 °C) is increased by 
40% compared to the pre-industrial period (c =0.65°C). To determine the effect 
of SST variability on the teleconnection strength, we re-compute the composites 
using SSTpac anomalies in the range -£0.81°C < SSTpac < +£1.63 °C, corresponding 
to SSTpac at 0 kyr ago of + 1.250 to + 2.50. By imposing both an upper and lower 
bound, the atmospheric response in the 500-hPa height composites is unaffected 
by the amplitude of simulated ENSO events. Furthermore, any change in the fre- 
quency of ENSO events is rendered unimportant by taking an average composite 
of events. 

The composite patterns for all simulations from 28 kyr to 0 kyr ago, as well as 
how they differ from the pre-industrial period, are unchanged by imposing the 
upper and lower bound, while the amplitude of the composites is slightly reduced. 
For the ‘Full 21ka simulation, the average amplitude of the Zag, is 32m without 
the upper bound, and 30 m with the upper bound imposed (Extended Data Fig. 6). 
Therefore, even though HadCM3 simulates an increase in the strength of ENSO 
during the LGM, this is not the primary cause for the change in Zag, variability. 
Rather, the increased Zag, variability occurs because the atmospheric response to 
SSTpac during the glacial period is stronger than during the pre-industrial period. 
Teleconnection changes resulting from ENSO changes using the PMIP. For a subset 
of PMIP2/3 simulations (those for which sufficient data was available*”), we follow 
the same procedure as outlined above to construct composites of 500-hPa height 
response to SST,,- anomalies. We include both the lower and upper SST)ac bounds. 
The composites are computed for the ‘Pre-industrial’ and ‘Full 21ka simulations 
(Extended Data Fig. 7). A change in the tropical Pacific-to-ASL teleconnection is 
apparent in many of the models. Since each model has a different teleconnection 
in the pre-industrial, the response of the models to boundary conditions 21 kyr 
ago is also different. 

In the LGM simulations, PMIP2/3 models with enhanced SST): variance do not 

display a consistently different teleconnection compared to models with reduced 
SSTpac variance. This confirms that changes in the tropical Pacific-to-ASL tele- 
connection strength are not driven by changes in the amplitude of SST,, anoma- 
lies (that is, ENSO). Indeed, in comparison to HadCM3, the most similar change 
in the teleconnection occurs in FGOALS-1.0g simulations, part of the PMIP2/3 
model suite. FGOALS-1.0g shows a large decrease in ENSO strength at 21 kyr 
ago, compared to the HadCM3 increase. The FGOALS-1.0g positive geopotential 
height anomaly is shifted eastward, consistent with the location of the regional 
low-pressure centre in the pre-industrial simulations. 
Boundary condition changes responsible for the teleconnection change. In ‘Full 21ka 
simulations, a number of the climate forcings are substantially changed compared 
to the pre-industrial period. These include insolation, the greenhouse gas concen- 
trations, and the size of the ice sheets (Extended Data Fig. 8a). We investigate the 
impact of each of these boundary condition changes on the teleconnection. The 
‘21ka Orbit + GHG’ simulation does not cause a significant change (Extended 
Data Fig. 8b). In the ‘21ka Ice Sheets’ simulation, which includes reduced sea 
level, enhanced albedo in the higher latitudes, and altered topography over North 
America, Europe, Antarctica, and the Southern Andes, there is a large change 
in the amplitude of the teleconnection (Extended Data Fig. 8c). Prior modelling 
simulations have explored other aspects of the topographic effects of Northern 
Hemisphere ice sheets”, including their influence on glacial climate™’, abrupt 
glacial climate change”, the Atlantic Meridional Overturning Circulation”®, and 
Heinrich events””. 

The ‘21ka LCIS only’ simulation displays a statistically significant (P < 0.05) 
change in the teleconnection strength, similar in magnitude to the ‘Full 21ka’ simu- 
lation (Extended Data Fig. 8e). The ‘21ka LCIS albedo’ simulation (that is, albedo 
only) displays a statistically significant change in the teleconnection, but its effect is 
much smaller than that of topography and albedo combined (Extended Data Fig. 8f). 
Although we cannot isolate the effect of topography alone, our results suggest 
that topography is the dominant mechanism acting to change the teleconnection 
strength. This agrees with the WDC shift in variability about 16 kyr ago, which is 
concurrent with the abrupt lowering of the LCIS™. 

It has been proposed that lowered sea level in the West Pacific at the LGM 
can have a large impact on tropical Pacific climate by exposing the continental 
shelves”*°8, We test this by changing only the land-sea mask in the West Pacific in 
the ‘21ka Shelf Exp’ simulation (Extended Data Fig. 8d). This results in a change to 
Zasz that is 3 to 4 times smaller than that seen in the ‘21ka LCIS only’ simulation. 
There is also a statistically significant change in the composite map only to 
the north of the ASL. In the WDC diffusion-corrected variability (3-7-yr and 
4-15-yr bands), there appears to be a step-change that occurs at about 13 kyr ago. 
This timing is consistent with the flooding of the continental shelves in the West 
Pacific****. Therefore, it is possible that the WDC data are detecting flooding of 
the continental shelves, but this effect is secondary to the abrupt change at approxi- 


mately 16 kyr ago. 


LETTER 


Climatic changes associated with the teleconnection change. The tropical Pacific- 
to-ASL teleconnection can be considered as a Rossby Wave response to diabatic 
heating in the western tropical Pacific®. The location of this diabatic heating will 
depend upon the mean state. To understand how the mean state changes, we ana- 
lyse how the patterns of precipitation and SST change in the ‘21ka simulations 
relative to the pre-industrial period. Extended Data Fig. 9a shows the precipitation 
difference between the ‘Full 21ka’ simulation and the ‘Pre-industrial’ simulation. 
There is a prominent decrease in precipitation over the Maritime Continent, and 
an increase in precipitation over the Indian Ocean and the western and central 
tropical Pacific. This is consistent with proxy evidence”!>°!. In Extended Data 
Fig. 9b, the ‘21ka Ice Sheets’ simulation accounts for the majority of the precipi- 
tation changes in the ‘Full 21ka simulation. When we decompose this effect, the 
‘21ka LCIS only’ simulation accounts for most of the precipitation changes in the 
western and central tropical Pacific (Extended Data Fig. 9c), while the ‘21ka Shelf 
Exp: simulation accounts for most of the precipitation changes over the Maritime 
Continent and in the Indian Ocean (Extended Data Fig. 9d). We find that there is a 
much larger change in West Pacific diabatic heating (as shown by the precipitation 
field) in the ‘21ka LCIS only’ simulation than in the ‘21ka Shelf Exp’ simulation. 
Since the ASL responds to diabatic heating anomalies in the West Pacific, rather 
than the Indian Ocean, this is consistent with our interpretation that the LCIS 
dominates the tropical Pacific-to-ASL teleconnection strength change. The timing 
of the change in the precipitation pattern is shown in Fig. 4c. Around 16 kyr ago, 
the location of the tropical precipitation moves westward from its glacial position 
to its pre-industrial position in the far West Pacific. 

The location of diabatic heating is ultimately a response to the winds and SST. 

Extended Data Fig. 9 shows that the SST patterns for the ‘Full 21ka, ‘21ka Ice 
Sheets, and ‘21ka LCIS only’ simulations are all similar. Although the global mean 
SST is much cooler in the ‘Full 21ka simulation (Extended Data Fig. 9a), it is the 
spatial structure of the SST that determines the climatic response. Therefore it is 
not surprising that all three simulations alter the teleconnection in a similar way. 
Furthermore, the SST pattern is consistent across most of the LGM PMIP2/3 simu- 
lations, indicating a robust SST response to an ice sheet. The SST pattern in the ‘Full 
21k@ simulation is associated with cyclonic flow in the North Pacific that is forced 
by the presence of the LCIS. Prior modelling has shown that this wind response to 
the LCIS could result from a summertime weakening of the sub-tropical high and 
a wintertime deepening of the Aleutian Low™. 
Data availability. The WDC water isotope data that support the findings of this 
study are available at http://gcmd.gsfc.nasa.gov/search/Metadata.do?entry= 
NSF-ANT10-43167#metadata. Additional supporting data, including model 
results, are provided in Supplementary Data and Extended Data Figs 1-9. 
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Extended Data Figure 1 | Signal detection. a, Relative amplitudes 

for 1-yr, 2-yr, 3-yr and 4-yr periods calculated for 500-yr spectral data 
windows, normalized to the value for the annual signal in the most recent 
data window. Both the climate and diffusion affects the amplitude of 
these high-frequency signals. However, it is mainly the effects of diffusion 
that cause the loss of the annual signal at >14 kyr ago. Similarly, the 2-yr 
period is lost between 17 kyr ago and 19 kyr ago, when the rate of diffusion 
for WDC was highest!®. Periods >3 yr survive diffusion throughout the 
past 31 kyr. b, An example of a deconvolution calculation showing the 
observed 6D water isotope record (that is, raw data; dotted red line), and 
the diffusion-corrected record (black line). Although this calculation 

was not used for the data presented in this paper, it serves as a visual 

aid in understanding how the diffusion-corrected water isotope record 
would look in the time domain. We performed all diffusion-correction 
calculations in the frequency domain to reduce uncertainty. c, The power 
density spectrum for the Southern Oscillation Index from 1951 to 2017 
(black; 95% confidence intervals in grey), compared with a red noise null 


hypothesis (red) calculated from the average of 100 power spectrums 

of synthetic data that have the same autocorrelation and variance as 

the Southern Oscillation Index. The Southern Oscillation Index has 
power greater than the red noise across a broad spectral peak between 2 
and 17 yr, which can be subdivided into a 2-7-yr high-frequency peak 
and an 8-17-yr peak. Owing to the limited temporal span of modern 
observations, multi-decadal spectral estimates of the Southern Oscillation 
Index cannot be adequately defined. d, Diffusion-corrected relative 
amplitudes using 500-yr windows of WDC water isotope data. e, The 
difference in age of consecutive 5-mm WDC water isotope samples (blue) 
and a 500-yr sliding average (red). f, The WDC accumulation rate®’, 
inverted. The accumulation (in metres of ice equivalent per year), and by 
extension the difference in age of consecutive 5-mm WDC water isotope 
samples in e, undergoes large changes during the deglaciation at around 
18.5 kyr ago, occurring 2.5 kyr before the change point in teleconnection 
strength at about 16 kyr ago. 
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Extended Data Figure 2 | Change point detection. a, WDC 4-15-yr on 15.54 kyr ago; raw data (black), diffusion-corrected data (blue), 
variability for 500-yr data windows (dashed lines are 1o uncertainties; see Gaussian fit (red) with dashed 1o uncertainty bounds (see Methods). 
Methods). b, Regression test algorithm to determine the first significant The same calculation is made for 500-yr data windows. d, The subset 
change in the WDC 4-15-yr variability in a. The first coloured data point of diffusion-corrected 4-15-yr amplitudes (green) calculated at 100-yr 
below the P-value significance threshold occurs 16.44 kyr ago. c, Example resolution. The values are normalized to the amplitude value at 16.24 kyr 
of the diffusion-correction calculation for a 100-yr data window centred ago, which represents the change point towards smaller amplitudes. 
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Extended Data Figure 3 | ASL variability. Panels a and b show the 
variance o7 of the indices Zagy, (black) and Zigca (blue). a, The indices are 
computed from monthly mean data, then filtered with a 4-15-yr band pass 
filter. b, The indices are computed from annual mean output. We compute 
Zas. as the mean 500-hPa height in the region 55°-70° S and 195°-240° E. 
The blue lines show Zag, after linearly removing the SST), time series 
from the 500-hPa height field; this is the ASL variability unrelated to 

the tropical Pacific (Zjoca; see Methods). The markers are unfilled if the 
variance of Zag, at each time slice is 95% significantly different (F-test) 
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to that of the pre-industrial period. c, Map of the change in the variance 
of the 500-hPa height field between 21 kyr ago and the pre-industrial 
period. d, The variability that is linearly related to ENSO, Zpac (this is the 
part removed from Zag, to yield Zjocai). €, The variability with the effect 

of ENSO linearly removed (see Methods; this the equivalent of Zjocal). 
Changes not attributable to ENSO occur to the north of the Amundsen 
Sea, while changes over the Amundsen Sea are related to ENSO. In c-e the 
contour intervals are 40 m?, with colours changing every 80 m”. Negative 


contours are dashed. 
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Extended Data Figure 4 | Composites of the 500-hPa height field for 
ENSO events. a—c, Composite for the ‘Pre-industrial’ (a), ‘Full 21ka (b) 
and the difference between the ‘Full 21ka and ‘Pre-industrial’ simulations 
(c). Contours are filled (not white) when statistical significance exceeds 


95% using a Monte Carlo test. Negative contours are dashed. In a and b 

the contours are plotted every 5m and colours saturate at £25 m. The thin 
blue box shows where the ASL index is computed. In ¢ contours are plotted 
every 2.5m and colours saturate at +12.5 m. 
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values are derived from composites constructed using only the lower limits 
on the size of ENSO events (see main text and Methods). b, The t-score 

is indicated by the red line. Values outside the shaded red region are 95% 
statistically different from the pre-industrial value. 


Extended Data Figure 5 | Tropical Pacific-to-ASL teleconnection 
strength, computed using the ‘Full 28-O0ka’ simulations. a, Average 
500-hPa geopotential height anomaly in the Amundsen Sea region, 
computed within the purple box shown in Extended Data Fig. 4. These 
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Extended Data Figure 6 | Composites of the 500-hPa height field for 
ENSO events with and without upper bound. a, The teleconnection 
without upper limit using the ‘Full 21ka simulation. b, The teleconnection 
with upper limit using the ‘Full 21ka’ simulation. c, The difference in 
teleconnection between the ‘Full 21ka simulation and the ‘Pre-industrial’ 
simulation without upper limit. d, The same difference as ¢ with upper 


wey | if 
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limit. In all panels, negative contours are dashed, and contours are filled 
(not white) when statistical significance exceeds 95%using a Monte Carlo 
test. In a and b, the contours are plotted every 5 m and colours saturate at 
+25 m. Inc and d, contours are plotted every 2.5 m and colours saturate at 
+12.5m. 
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(a) FGOALS-1.0g. (b) IPSL-CM4-V1-MR. (c) CCSM3. 


(g) MRI-CGCMs. (h) MIROC-ESM. 


(j) IPSL-CM5A-LR. 


Extended Data Figure 7 | Composite maps of the 500-hPa height field 
for ENSO events in the PMIP2/3 models. For each model in a-k, the 

top panel shows the ‘Pre-industrial’ composite and the bottom panel the 
difference between the ‘Full 21ka’ and ‘Pre-industrial’ simulations. In the 
bottom panel, contours are filled (non-white) when statistical significance 
exceeds 95% using a Monte Carlo test. In the top panels, contours are 


(k) COSMOS-ASO. 


plotted at 5-m intervals, with colours saturating at 25 m. In the bottom 
panels, contours are plotted at 2.5-m intervals, with colours saturating 

at +12.5 m. Negative contours are dashed. The reduced statistical 
significance in these panels compared to those shown in Extended Data 
Figs 4, 6, 8 and 9 is due to the shorter data series available in the PMIP2/3 
archives. 
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Extended Data Figure 8 | Sensitivity of composite maps to different exceeds 95% using a Monte Carlo test. a, ‘Full 21ka’ simulation. b, ‘21ka 
sets of boundary conditions 21 kyr ago. All plots show the difference Orbit + GHG’ simulation. c, ‘21ka Ice Sheets’ simulation. d, ‘21ka Shelf 
between 21-kyr and 0-kyr composites. The composites are constructed Exp: simulation. e, ‘21ka LCIS only’ simulation. f, ‘21ka LCIS albedo’ 
using both upper and lower limits on the size of ENSO events. Contours simulation. Each of these simulations is fully defined in Supplementary 


are plotted every 2.5 m (negative contours are dashed) and colours saturate _—_ Data. 
at +12.5m. Contours are filled (not white) when statistical significance 
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Extended Data Figure 9 | Annual mean anomalies of precipitation and 
sea surface temperature. Maps of anomalies 21 kyr ago relative to the pre- 
industrial period. a, ‘Full 21ka simulation. b, ‘21ka Ice Sheets’ simulation. 
c, ‘21ka LCIS only’ simulation. d, ‘21ka Shelf Exp’ simulation. Annual 
means are calculated from 100 years of output. Contour intervals for 
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precipitation are 1 mm per day, and for sea surface temperature are 0.5 °C. 
Land areas are shown in grey. Note that the temperature colour scale in 

a ranges from —4°C to 0°C. This accounts for the mean greenhouse gas 
cooling that is seen in the ‘Full 21ka simulation. 
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Global patterns of declining temperature variability 
from the Last Glacial Maximum to the Holocene 


Kira Rehfeld!*, Thomas Miinch!?, Sze Ling Ho!* & Thomas Laepple! 


Changes in climate variability are as important for society to address 
as are changes in mean climate!. Contrasting temperature variability 
during the Last Glacial Maximum and the Holocene can provide 
insights into the relationship between the mean state of the climate 
and its variability”*. However, although glacial-interglacial changes 
in variability have been quantified for Greenland’, a global view 
remains elusive. Here we use a network of marine and terrestrial 
temperature proxies to show that temperature variability decreased 
globally by a factor of four as the climate warmed by 3-8 degrees 
Celsius from the Last Glacial Maximum (around 21,000 years ago) to 
the Holocene epoch (the past 11,500 years). This decrease had a clear 
zonal pattern, with little change in the tropics (by a factor of only 
1.6-2.8) and greater change in the mid-latitudes of both hemispheres 
(by a factor of 3.3-14). By contrast, Greenland ice-core records show 
a reduction in temperature variability by a factor of 73, suggesting 
influences beyond local temperature or a decoupling of atmospheric 
and global surface temperature variability for Greenland. The 
overall pattern of reduced variability can be explained by changes 
in the meridional temperature gradient, a mechanism that points 
to further decreases in temperature variability in a warmer future. 

There is scientific consensus that the mean global temperature has 
been rising over the instrumental era*. However, whether this warming 
has caused surface temperatures to become more’ or less®” variable, 
and how this variability will change in a warmer future, remain topics 
of debate. Here we use palaeoclimate proxy data to quantify changes 
in temperature variability before and after the last major transition 
in global mean climate: the 3-8 °C warming® from the Last Glacial 
Maximum (LGM, around 21,000 years (21 kyr) ago) to the current 
warm period of the Holocene (Fig. 1). The magnitude of temperature 
change during this transition is in the same range as that projected for 
the coming centuries’. 

The global spatial pattern of the mean LGM-to-Holocene tem- 
perature change has been established through numerous studies*!”, 
However, except for some studies on changes in interannual climate 
variability in the tropics'!, our current understanding of changes in 
variability is largely based on the stable oxygen isotope records of the 
high-resolution central Greenland ice cores”. The isotope records, 
which are interpreted as a proxy for temperature!3, show that the last 
glacial period appears to have been not only cold but also highly var- 
iable on decadal to millennial timescales». This finding is not lim- 
ited to the magnitude of distinct events, such as the Heinrich stadials 
(that is, cold periods in Greenland) or the abrupt transitions into the 
Dansgaard-Oeschger interstadials; it also holds for the background 
variability during the LGM (Fig. 1b). 

Consequently, glacial climate has been characterized as highly 
variable", whereas the Holocene is commonly described as a stable 
and quiescent period’. The large reduction in variability was proposed 
to have supported human dispersal throughout Europe" and cultural 
evolution!®. However, the evidence for an exclusively stable Holocene 
climate—beyond that of Greenland ice-core records—is unclear, 


particularly because other proxy records for temperature in and outside 
of Greenland suggest considerable variability during the Holocene!™””. 

In this study, we derive a quantitative estimate for global and regional 
change in temperature variability between the LGM (27-19 kyr ago) 
and the Holocene (8-0 kyr ago) based on high-resolution palaeoclimate 
proxy records for temperature (Fig. 1a). These time periods represent 
rather stable boundary conditions with minimal changes in ice-sheet 
size and sea level. Furthermore, our LGM time window contains only 
one small Dansgaard-Oeschger event, thereby enabling us to focus 
our analysis on the glacial background state. We compile two global 
datasets (Methods). The first (‘joint’) dataset contains 28 records that 
cover both the LGM and the Holocene. We estimate the change in 
variability from the ratio of LGM to Holocene variance separately for 
each record and thus independently of calibration uncertainties, as long 
as the calibrations are constant over time. This is a reasonable assump- 
tion because state-dependent calibrations have been proposed only 
for Greenlandic ice cores'’, and we take this into account. Analysing 
variance ratios from single cores also minimizes site-specific effects on 
the variance estimates, such as the ecological preferences of the organ- 
isms that record the climate signal or the extent of bioturbation, which 
affects marine proxies in sediments (Methods). The second (‘separate’) 
dataset is more extensive, containing 88 records for the Holocene and 
39 for the LGM. In this case, we first derive zonal mean estimates of 
temperature variability for each time slice and then form the ratio. All 
proxy types for which multiple calibrations exist were recalibrated using 
a single temperature relationship for each proxy type and region. For 
both the joint and the separate dataset, we quantify the change in varia- 
bility as the ratio of variance at timescales between 500 and 1,750 years 
in the spectral domain using a method that is insensitive to changes 
in the temporal sampling. We correct the ratio for the effects of non- 
climate variability in the proxy records using independent estimates of 
the signal-to-noise ratio of the proxies (Methods). 

All three Greenlandic ice-core records display large changes in vari- 
ability, with an average LGM-to-Holocene variance ratio R = Vigm/ Vhol 
of 73 (90% confidence interval of 50-112; Fig. 2a). In contrast to this 
marked reduction, the area-weighted average change in variability 
for the rest of the globe is far lower: the separate estimate indicates 
a decrease in variability by a factor of 7.0 (90% confidence interval, 
2.2-16). The large uncertainty range is due to the combination of many 
different proxy records which are affected by potential site-specific 
effects such as differing seasonal responses. The magnitude of change 
is confirmed by the joint dataset, which offers a more precise estimate 
of R=4.4 (90% confidence interval, 2.5-6.6) by circumventing these 
complications. Together, these datasets suggest a significantly lower 
(P<0.01) change in variability outside of Greenland than is found in 
Greenlandic ice-core records. The discrepancy also cannot be recon- 
ciled by considering a potentially lower quality of marine-based tem- 
perature reconstructions (Methods). This observation suggests that 
Greenlandic ice-core records should not be used as a sole reference for 
climate variability, particularly concerning the amplitude of change. 


1Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research, Telegrafenberg A43, 14473 Potsdam, Germany. @British Antarctic Survey, High Cross, Madingley Road, Cambridge CB3 
OET, UK. SInstitute of Physics and Astronomy, University of Potsdam, Karl-Liebknecht-Strasse 24-25, 14476 Potsdam, Germany. “University of Bergen and Bjerknes Centre for Climate Research, 


Allégaten 41, 5007 Bergen, Norway. 
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Figure 1 | Proxy records for temperature. a, Site locations (symbols) and 
mean LGM-to-Holocene temperature change (Tigm — Thos background), 
estimated from climate model and proxy data’. The pre-industrial 

(Ap 1850) temperature is used as a surrogate for the Holocene time 

slice because we are interested in only the first-order pattern of the 
deglaciation. b, North Greenland Ice Core Project (NGRIP) ice-core 6'8O 


The spatial pattern of variability change (Fig. 2b-d) shows a 
distinct latitude dependence (Fig. 3a). A small, yet statistically 
significant, change can be found in the tropics (20° S-20° N; R=2.1; 
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Figure 2 | Global LGM-to-Holocene change in variability and 
temperature gradient. a, Distribution of the globally averaged area- 
weighted LGM-to-Holocene variance ratio (Vigm/Vhos without Greenland; 
red denotes the joint dataset, orange the separate dataset) and the regional 
Greenland variance ratio (black). Note that for visibility the Greenland 
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data for Greenland (ref. 12; black, expressed in %o with respect to Vienna 
Standard Mean Ocean Water) with millennial trend (blue) and bandpass- 
filtered temperature (AT; 0.5-1.75 kyr}; red) for Holocene and LGM 
(grey lines in background show the full record). c, Sea-surface temperature 
(SST) inferred from Mg/Ca ratios from tropical marine sediment record 
$0189-39KL*; colours as in b. 


90% confidence interval, 1.6-2.8). The mid-latitudes (20-50° S and 
20-50° N) show a moderate decrease in variability from the glacial 
period to the Holocene by a factor of 5.4 (90% confidence interval, 
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density estimates are on a separate y axis. b-d, LGM-to-Holocene proxy- 
derived variance ratios (symbols, bottom colour scale) and modelled 
change in temperature gradient’ (background, right colour scale; details in 
Methods) for the globe (b), Greenland (c) and Antarctica (d). 
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Figure 3 | Latitudinal structure of LGM-to-Holocene variability and mean 
changes. a, Zonal-mean change in variability from the proxy compilations 
(red bars denote the joint estimate, orange points the separate estimate). 

b, Latitude dependence of the equator-to-pole change in temperature 
gradient. The five-point smoothed zonal-mean change in gradient (black 
line) is shown together with the change in gradient at the proxy locations 
(black squares), compared to the individual proxy estimates of the change in 
variability (red circles). Red and green shading denotes the 90% confidence 
interval of the global-mean change in variance without Greenland and of the 
mean change in variance for Greenland, respectively. c, Zonal-mean change 
in temperature’. All error bars are 90% confidence intervals. 
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3.3-10) and 11 (90% confidence interval, 8.0-14), respectively. The 
polar regions (50-90° S and 50-90° N) are represented by only 
Greenlandic and Antarctic ice-core records and reveal an asymmetric 
pattern: the Greenland change is the highest globally, whereas 
Antarctica displays only a small change (R=2.5; 90% confidence inter- 
val, 2.0-3.2), comparable to that in the tropical ocean. West Antarctic 
ice cores show a stronger variability change than do ice cores from East 
Antarctica (Fig. 2d), a finding that is similar to the west-east contrast 
in the response to anthropogenic forcing!”. The estimated pattern of 
variability change is similar for multicentennial and millennial time- 
scales (Extended Data Fig. 1), which demonstrates that our finding is 
not limited to one specific frequency band. This result further suggests 
that the Dansgaard—Oeschger event included in the LGM time slice has 
only a minor influence. 

The equator-to-pole surface air temperature gradient in the LGM was 
larger than in the Holocene, because the high latitudes have warmed 
more than the tropics since the LGM!° (Figs 1a, 2b). Furthermore, 
the land-sea contrast in mid- to high latitudes was stronger in 
the LGM because a relatively warm open ocean contrasted with the 
partly ice-covered land, and changing sea-ice cover affected both the 
meridional and zonal temperature gradients”°. Atmospheric tempera- 
ture gradients are a primary driver of local temperature variability on 
synoptic timescales. Accordingly, changes in spatial gradients due to 
mean climate changes have been proposed to control changes in vari- 
ability?”*, Hence, steeper temperature gradients in the LGM may have 
led to increased synoptic variability. Describing climate variability as 
the linear response to stochastic weather forcing integrated by the slow 
components of the climate system, such as the ocean”, increased synop- 
tic variability relates directly to an increase in variability on interannual 
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to millennial timescales”*. Contrasting the change in the atmospheric 
equator-to-pole temperature gradient—as estimated from a combined 
model-data temperature reconstruction’—with the estimated 
change in variability (Fig. 3b, Extended Data Fig. 2) indeed reveals 
a consistent pattern on a global scale (Spearman’s rank correlation 
coefficient r= 0.43, P=0.03, n = 28), although the high variability 
reconstructed for Greenland appears as an outlier (Fig. 3b). This rela- 
tionship between temperature gradient and variability change also 
holds for the heterogeneous pattern of temperature variability change 
over Antarctic land surfaces (Fig. 2d), although the quality of the gra- 
dient estimates on this regional scale is unclear. In addition, a reconfig- 
uration of the large-scale oceanic circulation could also drive changes 
in temperature variability. Perturbation experiments in climate models 
suggest that the Atlantic meridional overturning circulation (AMOC) 
may have been less stable in the LGM than in the Holocene”, and the 
temperature response to a varying AMOC that modulates the oceanic 
poleward heat flux shows a first-order pattern”® that is consistent with 
our estimated changes in variability (Fig. 3). However, there is no evi- 
dence that the imprint of AMOC modulations should be greater on 
Greenlandic air temperatures than on any other North Atlantic region. 

The general meridional pattern is thus consistent with both synoptic 
atmospheric and oceanic contributions to the change in variability. 
However, neither contribution can explain the considerably stronger 
variability change found in the oxygen isotope records from 
Greenlandic ice cores, which is 18 times stronger than the global 
mean—a polar-to-global change in variance that is much larger than 
the observed polar amplification during the twentieth century*. In 
addition, the resultant asymmetry between Greenlandic and Antarctic 
variability change contrasts with the rather symmetric polar amplifi- 
cation that is simulated by climate models for past and future climate 
states”®, The specific discrepancy for the Greenlandic records thus 
points either to a decoupling of Greenlandic temperature variability 
from global surface temperature variability, for example owing to the 
altitude of the ice sheet representing close to mid-tropospheric atmos- 
pheric conditions, or to strong influences on the isotopic composition 
of Greenlandic ice cores beyond the local site temperature. 

Sea-ice changes have been linked to temperature variability changes 
on interannual to decadal timescales’, and may also contribute to the 
uniqueness of the Greenlandic variability estimates. The sea-ice extent 
during the glacial period was larger than at present””, and the increased 
area favoured increased sea-ice variability on centennial timescales, 
a change that is corroborated by proxy-based sea-ice reconstructions 
(Extended Data Table 1). A large sea-ice lid shields more ocean heat 
from the atmosphere, reduces the effective heat capacity at the surface 
and thus also renders local temperatures more volatile under the same 
forcing. Additionally, a larger sea-ice area can change more, which 
amplifies temperature variability on the Greenland ice sheet through 
atmospheric feedbacks””. Changes in sea-ice extent also influence the 
seasonality of snow accumulation on the central Greenland ice sheet”®, 
which can strongly affect the isotopic composition recorded in ice 
cores”. Furthermore, changes in the moisture pathways as an atmo- 
spheric response to the large Northern Hemisphere ice sheets could 
also have caused changes in isotope variability that are unrelated to 
local temperatures*”. 

On the interannual to multidecadal scale, the surface temperature 
variability ratio in coupled model simulations from PMIP3 confirms 
the overall reduction in temperature variability from the LGM to the 
Holocene (Methods, Extended Data Fig. 3). The spatial pattern is 
similar, but the magnitude of change is smaller (R= 1.28; 90% confi- 
dence interval, 1.25-1.30), suggesting either a difference in the par- 
titioning of variability between fast and slow timescales, or that the 
models suppress long-term climate variability!” and so do not display 
realistic variability changes. The tendency of coupled climate models 
to underestimate changes in the meridional temperature gradient”® 
might also contribute to this discrepancy. To establish to what extent 
variability change is uniform across timescales, as predicted by linear 
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energy balance models“, or is specific to certain timescales related to 
dynamic modes in the climate system, variability estimates at decadal 
to centennial scales are needed. Possibilities include annually laminated 
sediment records or a better understanding of non-climate effects on 
ice-core records to enable reliable high-resolution reconstructions. The 
PMIP3 climate model results also suggest that the temperature vari- 
ability change in Greenland is not larger than elsewhere. Therefore, 
it is paramount to establish whether the Greenlandic variability 
change is indeed a change in local temperature variability or specific 
to the oxygen isotope proxy for temperature. The representativeness 
of Greenlandic isotope variability for Arctic and global temperature 
variability could be clarified using non-stable-water-isotope proxies 
for temperature in Greenland'®, more data from across the Arctic and 
climate modelling with embedded water-isotope tracers. 

Our results have implications for the understanding of past and 
future climate variability. The reconstruction reveals that temperature 
variability decreased globally by a factor of four for a warming of 3-8°C 
from the LGM to the Holocene. This decrease is small compared with 
the reduction by a factor of 73 estimated for Greenland and indicates 
that the change in variability recorded by Greenlandic ice cores is not 
representative of changes in variability across the globe. In terms of 
the magnitude of variability, these iconic datasets thus do not provide 
a reference for global climate changes as is often implicitly assumed. 
Consequently, we have to rethink the notion of an unstable glacial 
period and a very stable Holocene and the implications for societal 
evolution. Although a direct extrapolation from the glacial period to 
the future would not be prudent, it is reasonable to assume that the rela- 
tionship between mean change and variability change holds, given our 
mechanistic understanding of the drivers and the direction of future 
changes in the temperature gradient. Our findings therefore add sup- 
port to climate modelling studies that predict a reduction in winter 
temperature variability under global warming via reduced spatial 
gradients”. Our results further suggest that this variability (which 
dominates annual-mean temperature variability), might also translate 
to a reduction in multidecadal and slower variability’. More high- 
resolution records of glacial climate, continued quantification of 
recording and preserving processes of palaeoclimate signals, and an 
extension of similar analyses to other climate states will help to further 
constrain the mean-state dependence of climate variability. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Proxy data for variability estimates. For the variability analyses we collected all 
available proxy records for temperature that fulfilled the following sampling cri- 
teria. To be included, a record had to (1) be associated with an established, pub- 
lished calibration to temperature and (2) cover at least 4 kyr in the interval of the 
Holocene (8-0 kyr ago) and/or the LGM (27-19 kyr ago) at (3) a mean sampling 
frequency of 1/225 yr! or higher. Our definition for the LGM time slice, based 
on previously published starting*” and end" times, covers the coldest part of the 
last glacial with the most stable boundary conditions while maintaining the same 
period duration as for the Holocene section. All proxy time series which fulfil 
the sampling criteria for both time intervals are included in our primary ‘joint 
dataset. All time series which fulfil the criteria for only one of the two intervals 
are included only for this period (‘separate’ dataset). This dataset consequently 
also includes all records from the joint dataset. All selected records are listed in 
Supplementary Information along with the time intervals for which they were 
included. Extended Data Table 2 summarizes the individual variance ratio esti- 
mates for the joint dataset. 

Model-based estimates for the temperature gradient and variability change. 
Changes in temperature gradient between the LGM and the Holocene were 
estimated on the basis of an LGM-to-pre-industrial temperature anomaly” 
based on proxy and model data from the Paleoclimate Model Intercomparison 
Project Phase 2 (PMIP2). The equator-to-pole temperature gradient change was 
calculated from the temperature anomaly differences between adjacent grid- 
boxes in the poleward direction (north relative to south), divided by the merid- 
ional gridbox extent (222 km) and normalized to 1,000km. The model-based 
LGM-to-Holocene variability change estimate was derived from surface (2 m) air 
temperature output for the LGM and pre-industrial simulations available through 
PMIP Phase 3 (PMIP3-CMIP5) archives. Model simulations were included from 
the CCSM4, CNRM-CM5, FGOALS-g2, GISS-E2-R, IPSL-CM5A-LR, MIROC- 
ESM, MPI-ESM-P and MRI-CGCM3 models. For each model, the last 100 years 
of the archived simulations were used to estimate temperature variance fields. The 
fields of the ratio of variances were then re-gridded to a common T63 resolution 
to form the model-mean ratio of variances (Extended Data Fig. 3). We use the 
pre-industrial model results as a reasonable surrogate for the Holocene time slice 
because we are interested in the first-order patterns of the gradient and variability 
changes, which are governed by the deglaciation. 

Temperature recalibration of proxy records. Marine and ice-core records were 
recalibrated using a single temperature relationship for each proxy type and region 
to minimize the calibration-dependent uncertainty for variability estimates based 
on the separate dataset. Terrestrial records based on lacustrine sediments, pollen 
and tree data were not recalibrated owing to the lack of a suitable global calibration 
for these proxy types. 

Recalibration of ice-core records. For the calibration of ice-core stable isotope 
data to temperature (isotope-to-temperature slope in°C per %o) two distinct meth- 
ods exist: based on either the relationship of observed present-day spatial gradients 
in surface snow isotopic composition and temperature (spatial slope) or temporal 
gradients observed at a single site (temporal slope). 

For Greenland, temporal slopes appear to lie consistently above the spatial slope, 
depending on the timescale, probably owing to changes in moisture origin and 
seasonality of precipitation!®. For the Holocene temporal slope we used a borehole 
temperature calibration®* of 2.1°C per %o, with an estimated uncertainty of -+0.4°C 
per %o based on the slopes reported by other studies*!~*°. The LGM temporal slope 
is a factor of 1-2 greater than the Holocene slope*”**“”"’; as a best guess we used 
a factor of 1.5. 

For Antarctica, direct estimations of temporal slopes are difficult. However, the 
difference between spatial and temporal slopes as well as the timescale dependence 
of the latter is expected to be small**. Here, we adopted reported spatial slopes! of 
1.25°C per %o for §'8O and 0.16 °C per %o for 6°H, with an uncertainty of 20% for 
recalibrating the Antarctic ice-core data. 

For tropical ice cores, we adopted a constant calibration slope for 8!8O of 1.49°C 
per %o". 

Recalibration of marine records. Marine proxy records were recalibrated if the 
proxy type occurred more than once in our data collection and a suitable global 
calibration existed. Most of the Mg/Ca records in our compilation are based on 
planktic foraminifera Globigerinoides ruber, converted to temperatures using the 
calibration*® Mg/Ca = b x exp(a x SST), with a= 0.09 mmol mol” !°C™}, 
b=0.38 mmol mol}, and standard errors s,= 0.003 mmol mol~!°C~! and 
sp=0.02 mmol mol"!. For consistency, we recalibrated other G. ruber Mg/Ca 
records to the same calibration even though it is established using sediment trap 
samples and hence not a global calibration. For species other than G. ruber, that 
is, Globigerina bulloides (two records from different regions) and Neogloboquadrina 
pachyderma (left-coiling; one record), we kept the Mg/Ca records as published. 
Similarly, temperature records based on the transfer function of diatom, 
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radiolarian and foraminifera assemblages were kept as published. All U®'37-based 
records were recalibrated using the calibration!” US'37= a x SST + b, with 
a=0.033°C~!, b= 0.044, s;=0.001°C~! and s,=0.016. All TEXg and TEX, 
records were recalibrated to the subsurface TEX{}, calibration** T = a x TEX{,+ 8 
with a= 40.8 °C, b = 22.3 °C, s,= 4.37 °C and s, = 2.19 °C, because marine surface 
and subsurface temperature variability are on average similar**. 
Timescale-dependent variance and variance ratio estimation. The records were 
interpolated onto a regular time axis given by their individual mean sampling fre- 
quency in the LGM or the Holocene, following a previously reported procedure!’. 
To minimize aliasing, data were first linearly interpolated to ten times the target 
resolution, low-pass filtered using a finite response filter with a cut-off frequency 
of 1.2 divided by the target time step, and then resampled at the target resolution. 
Linear interpolation of a process that has been unevenly sampled reduces the vari- 
ance near the Nyquist frequency, but the sampling rate of our records relative to the 
timescale of the variance estimates is high enough to minimize this effect (Extended 
Data Fig. 4). Timescale-dependent variance estimates were obtained by integrating 
the raw periodogram” in the frequency band (fj, fs) using fj = 1/500 yr! 
and f; = 1/1,750 yr! to capture multicentennial- to millennial-scale temperature 
variability. All spectra are shown in Extended Data Fig. 4. Tests with surrogate 
records on the original time axes demonstrate that our estimates are largely 
unbiased (Extended Data Fig. 5). Furthermore, our results are robust to changes 
in the sampling criteria (Extended Data Fig. 1). 

Confidence intervals for the variance estimates were derived from the 7 dis- 
tribution with d degrees of freedom, in which d is given by twice the number of 
spectral power estimates in the frequency band (f,, f:). Confidence intervals for 
variance ratios were derived accordingly from the F distribution with the degrees 
of freedom of the variance estimates. 

For the joint dataset, zonally averaged variance ratios were derived from the 
bias-corrected individual ratio estimates as 


in which Rj= Vigm,i/ Vnol, is the noise-corrected variance ratio of the ith record. For 
the separate dataset, zonally averaged variance ratios were derived from the ratio 
of the zonal-mean variances with subsequent noise correction. 

For both datasets, global-mean variance ratios were derived from the area- 
weighted zonal means. To obtain the ratio distributions (Fig. 2a), we sample 50,000 
times with replacement from the proxy estimates (joint, ratios; separate, variances). 
For each realization, we form the zonal-mean estimates of the variance change 
(for the joint dataset), or of the mean Holocene and LGM variance and then take 
the ratio (for the separate dataset). We then form the area-weighted global mean 
for the variance change. Confidence intervals for the global-mean estimate are 
derived as quantiles from the realizations. The ratio distribution for Greenland is 
estimated using the same method but considering only the three Greenlandic ice 
cores. Kernel density estimates are shown (Fig. 2a) using a Gaussian smoothing 
kernel with a bandwidth of 1/10 of the mean ratio, so 0.4 for the global mean and 
7 for Greenland. 

Noise correction. We determine the effect of noise on the estimated variance ratio 
R’ between two climate periods, 


Rez var(X)) 
“ var(Xo) 


(1) 


Here, X; and Xo are the proxy time series of the investigated (LGM) and reference 
(Holocene) climate periods, respectively. Each proxy time series contains noise. 
Assuming additive noise, and that the climate signal and noise are uncorrelated on 
each timescale covered, we can split the variances in equation (1) into contributions 
from the signal S and the noise e: 


var(S}) 
var(So)[1 -+- SNR7}] 


var(€1) 


R= var(S1) + var(e1) _ 
var(So)[1-+SNR7}] 


var(So) + var(éo) 


in which we have introduced the reference-period signal-to-noise variance ratio, 
SNR=var(So)/var(€o). Identifying the true climate variance ratio, R= var(S))/ 
var(So), and denoting the noise variance ratio by F-=var(¢ )/var(€o), we obtain 


,. SNR», E 
1+SNR ° 1+SNR 
Solving for R yields 
RaRLTSNR __E Q) 
SNR SNR 
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Because R cannot be negative, the parameters must always satisfy the condition 
F./(1 + SNR) < R’. For any R’ > F., the effect of noise dampens the true ratio 
(R> R’; Extended Data Fig. 6a). 

To correct for the effect of noise on the LGM-to-Holocene variance ratio, we 
apply equation (2) both to every individual variance ratio estimated for the joint 
dataset and to the zonal-mean variance ratios derived from the separate dataset. A 
reasonable assumption is that the noise level is independent of the climate period 
(F-= 1), which we adopt for all analyses. For the joint dataset, we assume a signal- 
to-noise ratio of 1.5 for the Greenland records and of 1 for all other records. To 
correct the zonal-mean variance ratios derived from the separate dataset, we adopt 
a signal-to-noise ratio of 1. 

Testing the effect of the noise correction on the variability change difference. 
The signal-to-noise ratio is a considerable source of uncertainty for the noise cor- 
rection. Signal-to-noise ratios can be estimated, among other approaches, by direct 
forward modelling of the proxy’” or by correlation of nearby records!”°""*?. An 
overview of signal-to-noise ratios for the regions and proxies of interest is given in 
Extended Data Fig. 6c. We tested the effect of the noise correction on the difference 
between the Greenland ice-core-based variance ratio estimates and those from 
the proxy records outside Greenland. To bring the variance ratios into agreement, 
the signal-to-noise ratio of proxies outside Greenland would have to be less than 
0.05 (Extended Data Fig. 6b), which is one order of magnitude below published 
estimates for marine proxy’” and Antarctic isotope records. It is therefore unlikely 
that the observed variability difference can be attributed to Greenland ice cores 
being better recorders (that is, having a higher signal-to-noise ratio) than marine 
sediment or Antarctic ice-core records. 

Potential effect of ecological adaption and bioturbational mixing on marine 
variance ratios. Variability derived from biological proxies, that is, recorded by 
marine organisms, are possibly muted relative to the actual environmental changes 
owing to the tendency of organisms to adapt and seek their ecological niche (for 
example, of a certain temperature or nutrient range)**. Our results are based on 
the ratio of variability and not on absolute variability estimates. Therefore, for 
ecological adaptation to affect our results, LGM variability needs to be muted to a 
much larger extent than that for the Holocene. In the simple conceptual ecological 
model®, given the same temperature preference, larger variability would result in 
a stronger damping. However, the largest part of the variability seen by marine 
organisms is the seasonal and vertical temperature range in the depth habitat. 
This spread is controlled by insolation and stratification and not primarily by the 
climate state. The interannual to millennial variability, which we find to be larger 
in the LGM, contributes only a small fraction to the total variability and so should 
not be a primary control of the damping strength affecting the proxy records. 
Our oceanic temperature variability estimates for the joint dataset (containing 
both Holocene and LGM) are based on alkenone-based U*’37 (nine sites) and 
the Mg/Ca of planktic foraminifera G. ruber (six sites), the latter from tropical 
sites. Unlike planktic foraminifera, which have a preferred temperature niche, the 
known major producers of alkenones such as the coccolithophore Emiliana huxleyi 
occur throughout the global ocean from the tropics to the polar waters. Their 
abundance is mostly controlled by nutrient and light availability, which do not 
always co-vary with temperature. Most of our G. ruber Mg/Ca records are from 
the tropics, with Holocene temperatures (such as 29°C at SO189-39KL; Fig. 1c) 
close to the warm end of their temperature niche (15-29 °C; ref. 54), whereas LGM 
temperatures (such as 26°C at SO189-39KL; Fig. 1c) are closer to the mean of the 
range. Therefore, if there is ecological adaptation, it is more likely to occur near 
the extremes (that is, the Holocene) rather than in the middle of the range. This 
would result in an amplified variance ratio between the Holocene and the LGM. 

Bioturbational mixing in marine sediments reduces the absolute variability pre- 
served in marine sediments. However, here we focus our analysis on variability 
changes and thus largely circumvent this problem, because both the glacial and the 
Holocene part of the core are affected by bioturbation. Bioturbation can be approx- 
imated as a linear filter®* and so the ratio of variances is not affected as long as the 
sedimentation rate and bioturbation strength that define the filter are similar in 
both time periods or do not change systematically between climate states. Our data- 
set shows no evidence of a systematic change in sedimentation rate with 7 of the 16 
marine cores in our joint dataset showing higher and 9 lower sedimentation rates 
in the Holocene (with a statistically insignificant change in mean sedimentation 
rate of 20%). The changes also show no detectable latitudinal dependency. There 
is also no evidence of a systematic change in largely unconstrained bioturbation 
strength between the time periods in the manuscripts that describe the datasets. 

Despite not being effects of climate, the ecological preference of the organ- 
isms that record the climate signal and bioturbational mixing of the sediment can 
affect variability estimates and may thus add to site-specific variability changes. 
But the aforementioned arguments demonstrate that their effect is expected to be 
very small compared to the orders of magnitude difference between tropics and 
mid-latitudes and between marine cores and ice cores. 


Testing the effect of the proxy sampling locations on zonal-mean variance 
estimates. The proxy locations are not randomly distributed in space, which could 
lead to sampling biases. To test for a potential sampling bias we analyse the 2-metre 
temperature field of the past 7,000 years from the coupled atmosphere ocean 
TraCE-21K simulation®®. The time period is chosen to focus on the continuum of 
climate variability and to minimize the effect of the deglaciation. The temperature 
variance field for centennial and longer timescales is derived by estimating the 


variance at every grid point after applying a low-pass finite response filter with a 


cut-off frequency of 1/100 yr}. 


We sample the variance field at the actual proxy locations and average the results 
into the same latitude bands as for the proxy-based variance ratio estimates. To 
estimate the expected distribution of mean values from unbiased locations, we 
sample N random locations at each latitude band, where N corresponds to the 
number of actual records in each band. We form the mean of this random sample 
and repeat the procedure 10,000 times, from which we report the 90% quantiles. 
The results (Extended Data Fig. 7) show that the mean values from the actual proxy 
locations are always inside the expected distribution. This result holds when using 
the full dataset and when restricting the analysis to the records that cover both the 
LGM and the Holocene. 

Code availability. Code is available on request from the authors. 

Data availability. All data that support the findings of this study are available 
within the paper, given references or Supplementary Information. Source Data 
for Figs 2 and 3 are provided with the paper. 


32. Rasmussen, S. O. et al. Synchronization of the NGRIP, GRIP, and GISP2 ice 
cores across MIS 2 and palaeoclimatic implications. Quat. Sci. Rev. 27, 18-28 
(2008). 

33. Vinther, B. M. et al. Holocene thinning of the Greenland ice sheet. Nature 461, 
385-388 (2009). 

34. Shuman, C. A. et a/. Temperature and accumulation at the Greenland Summit: 
comparison of high-resolution isotope profiles and satellite passive microwave 
brightness temperature trends. J. Geophys. Res. 100, 9165-9177 (1995). 

35. Beltrami, H. & Taylor, A. E. Records of climatic change in the Canadian Arctic: 
towards calibrating oxygen isotope data with geothermal data. Global Planet. 
Change 11, 127-138 (1995). 

36. Cuffey, K. M., Alley, R. B., Grootes, P. M., Bolzan, J. M. & Anandakrishnan, S. 
Calibration of the 6!80 isotopic paleothermometer for central Greenland, using 
borehole temperatures. J. Glaciol. 40, 341-349 (1994). 

37. Cuffey, K. M. & Clow, G. D. Temperature, accumulation, and ice sheet elevation 
in central Greenland through the last deglacial transition. J. Geophys. Res. 102, 
26383-26396 (1997). 

38. Johnsen, S. J. et a/. The 6!80 record along the Greenland Ice Core Project deep 
ice core and the problem of possible Eemian climatic instability. J. Geophys. 
Res. 102, 26397-26410 (1997). 

39. Sjolte, J. et al. Modeling the water isotopes in Greenland precipitation 
1959-2001 with the meso-scale model REMO-iso. J. Geophys. Res. 116, 

D18105 (2011). 

40. Johnsen, S. J., Dahl-Jensen, D., Dansgaard, W. & Gundestrup, N. Greenland 

palaeotemperatures derived from GRIP bore hole temperature and ice core 

isotope profiles. Tellus B 47, 624-629 (1995). 

41. Cuffey, K. M. et al. Large arctic temperature change at the Wisconsin-Holocene 

glacial transition. Science 270, 455-458 (1995). 

42. Kindler, P. et a/. Temperature reconstruction from 10 to 120 kyr b2k from the 

GRIP ice core. Clim. Past 10, 887-902 (2014). 

43. Jouzel, J. Magnitude of isotope/temperature scaling for interpretation of 

central Antarctic ice cores. J. Geophys. Res. 108, 4361 (2003). 

44. Masson-Delmotte, V. et a/. A review of Antarctic surface snow isotopic 

composition: observations, atmospheric circulation, and isotopic modeling. 

J. Clim. 21, 3359-3387 (2008). 

45. Thompson, L. G. et a/. Late glacial stage and Holocene tropical ice core records 

rom Huascaran, Peru. Science 269, 46-50 (1995). 

46. Anand, P, Elderfield, H. & Conte, M. H. Calibration of Mg/Ca thermometry in 

planktonic foraminifera from a sediment trap time series. Paleoceanography 

18, 1050 (2003). 

47. Miller, P. J., Kirst, G., Ruhland, G., von Storch, |. & Rosell-Melé, A. Calibration of 

he alkenone paleotemperature index U37“’ based on core-tops from the 
eastern South Atlantic and the global ocean (60° N-60° S). Geochim. 
Cosmochim. Acta 62, 1757-1772 (1998). 

48. Ho, S. L. & Laepple, T. Flat meridional temperature gradient in the early 
Eocene in the subsurface rather than surface ocean. Nat. Geosci. 9, 606-610 
(2016). 

49. Chatfield, C. The Analysis of Time Series: An Introduction 6th edn (Chapman & 
Hall/CRC, 2004). 

50. Fisher, D.A., Reeh, N. & Clausen, H. B. Stratigraphic noise in time series derived 
from ice cores. Ann. Glaciol. 7, 76-83 (1985). 

51. Steen-Larsen, H. C. et a/. Understanding the climatic signal in the water stable 
isotope records from the NEEM shallow firn/ice cores in northwest Greenland. 
J. Geophys. Res. 116, D06108 (2011). 

52. Munch, T., Kipfstuhl, S., Freitag, J. Meyer, H. & Laepple, T. Constraints on 
post-depositional isotope modifications in East Antarctic firn from 
analysing temporal changes of isotope profiles. Cryosphere 11, 2175-2188 
(2017). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


53. 
54. 


55. 


Mix, A. in North America and Adjacent Oceans during the Last Deglaciation (eds 


Ruddiman, W. F. & Wright, H. E.) 111-135 (Geological Society of America, 1987). 


Hilbrecht, H. Extant Planktic Foraminifera and the Physical Environment in the 
Atlantic and Indian Oceans: An atlas based on CLIMAP and Levitus (1982) data. 
Technical Report (Eidgendssische Technische Hochschule und Universitat 
Zurich, 1996). 

Berger, W. H. & Heath, G. R. Vertical mixing in pelagic sediments. J. Mar. Res. 
26, 134-143 (1968). 


56. 
57. 


58. 


LETTER 


Liu, Z. et al. Transient simulation of last deglaciation with a new mechanism for 
Balling-Allerad warming. Science 325, 310-314 (2009). 

Miller, J. & Stein, R. High-resolution record of late glacial and deglacial sea ice 
changes in Fram Strait corroborates ice-ocean interactions during abrupt 
climate shifts. Earth Planet. Sci. Lett. 403, 446-455 (2014). 

Hoff, U., Rasmussen, T. L., Stein, R., Ezat, M. M. & Fahl, K. Sea ice and millennial- 
scale climate variability in the Nordic seas 90 kyr ago to present. Nat. Commun. 
7, 12247 (2016). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


100 + a timescale 500-1000 yr 


b timescale 1000-1750 yr 


_ 20 200 | 
2 2 
> > 
~~ —~ 
5 5 
a a 7 % 
5 53 3 
fo} lo} 
° ° 
rom roe 
n n 
2 2 
a | 4 
(-90,50]  (-50,-20]  (-20,20] ~—- (20,50) (50,90] (-90,50] — (-50,-20]  (-20,20] ~—- (20,50) (50,90] 
Latitude (°N) Latitude (°N) 
250 250 
Cc timescale 650-2000 yr d timescale 500-1750 yr 
100 100 
2 2 
“ 20 5 20 
> > 
% % 
® o 
5 8 5 8 
fai fom 
7) n 
2 2 
1 1 
(-90,50]  (-50,-20]  (-20,20] ~—- (20,50) (50,90) (-90,50]  (-50,-20]  (-20,20) ~—- 20,50] (50,90) 
Latitude (°N) Latitude (°N) 
Extended Data Figure 1 | Zonal variability change pattern for different 650-2,000-year timescale with a minimum of 20 data points (c); and 


timescales and length requirements. a—d, Results for the estimated zonal- — 500-1,750-year timescale with a minimum of 25 data points (d), which 
mean variance ratios based on the joint dataset are shown as a function of corresponds to the results shown in the main text. The number of records 
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proxy compilations (red bars denote the joint estimate, orange points the interval of the mean; error bars in c show the 90% confidence interval of 
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change based on the PMIP3-CMIP5 simulations for the LGM and the 


pre-industrial period. c. Individual variability change at the proxy 
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Extended Data Figure 4 | Raw periodograms of all records. Thin blue 
lines show the spectra of the Holocene time slice; thin green lines show the 
spectra of the LGM time slice. Logarithmically smoothed spectra are given 
as thick lines with 90% confidence intervals as shading. Grey areas indicate 
the frequency response outside the bandwidth used for the timescale- 
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dependent variance ratio estimate. x-axis scaling is in periods in years; 
y-axis scaling denotes power spectral density. Text insets give the 
time-slice variances for the LGM and the Holocene (‘Hol’) in K’; variance 
ratios for the records from the joint dataset are listed in Extended Data 
Table 2. 
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Extended Data Figure 5 | Surrogate tests for the magnitude of variance 
change. The magnitude of potential biases in the variance ratio estimates 
were derived using 1,000 realizations of power-law noise (slope 3= 1) 

of constant variance on the original time axes of the records. Analyses 

for variability quantification were performed as for the primary analyses 
and described in Methods. a, Histogram of the bias of the variance ratio 
estimated from the surrogate data. The mean of the distribution (red line) 
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is not significantly different from zero (c.i., confidence interval). 

b, Estimated zonal-mean ratios from the surrogate data. The individual 
surrogate zonal-mean ratios (black bars) are all close to 1 and show no 
latitudinal pattern, in contrast to the zonal-mean ratios from the proxy 
data (joint dataset, green bars). Error bars show the 90% confidence 
interval for the proxy data and +2 times the standard error of the zonal 
mean for the surrogate data (n = 1,000). 
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East Antarctica (EDML) 1.0 (0.5) correlation seasonal [52] 


Extended Data Figure 6 | Effect of the Holocene signal-to-noise ratio of 
proxy records on the noise correction of the variance ratios estimated. 
a, Noise correction as a function of the Holocene signal-to-noise ratio 
(SNR). The ratio of the true variance ratio to the estimated one, R/R’, is 
displayed for R’ =0.5 and R’=5 (dashed lines) for a noise variance ratio of 
F.=1. The shaded area denotes the region where for R’=0.5, no R/R’ >0 
exists. b, Test for the comparability of marine and Greenland ice-core 
variance ratios as a function of the signal-to-noise ratio. The expected 
value of R for the mean over all records of the joint dataset below 70° N 

is shown under the assumption of a wide range of signal-to-noise ratios 


(solid blue line), with uncertainty (dashed lines) of +2s.e.m. (n= 25). 
Within the realistic range of Holocene signal-to-noise ratios (shaded blue 
area, based on the published estimates listed in c), the noise-corrected 
global variance ratio (excluding Greenland) ranges from 1.7 to 11.4, 

which cannot be brought into agreement with the mean variance ratio 

of the Greenland ice cores (horizontal green line; shading denotes full 
uncertainty including the range of Greenland signal-to-noise ratios (c) 
used in the noise correction). c, Overview of published!”*°" proxy signal- 
to-noise ratio estimates for the Holocene. Greenlandic and Antarctic 
estimates refer to §!8O. CI, confidence interval. 
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Extended Data Figure 7 | Representativeness of the proxy data 
locations. The centennial temperature variability in the TraCE-21K 
simulation, sampled at the proxy locations (black circles), the zonal-mean 
variability (green line) and the mean of the variability in the zonal box, 
either formed only from the variance at the proxy sites (blue) or formed 
using all grid points (red), are shown. The red vertical lines show the 90% 


quantiles from the mean of N random samples of the variance field, where 
Nis the number of proxy sites in the zonal box. a, Results when sampling 
from the proxy locations of the separate dataset. b, Results when sampling 
from the joint dataset. In all cases the mean of the proxy sites is inside 

the distribution of random samples, which demonstrates that under the 
assumption of this variance field the proxy estimates are unbiased. 
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Extended Data Table 1 | North Atlantic sea-ice variability ratios 


Site Lat. (°N) Lon. (°E) Proxy R (90% c.i.) Ref. 
MSM5-5-712-2 78.9 6.8 PBIP25 2.1 (1.1-4.1) [57] 
MSM5-5-712-2 78.9 6.8 IP25 11.2 (5.6-22.1) [57] 
JM11-Fl-19PC 62.8 —3.9 PBIP25 14.1 (5.8-35.8) [58] 


The variance ratios R are listed, based on sea-ice reconstructions®”°8 from three North Atlantic records (two sites, one based on two different sea ice proxies). 
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Extended Data Table 2 | Individual variability ratio estimates for all records from the joint dataset 


No. Record Rest Realib (lower) Realib (upper) Rest (5% quantile) Rest (95 % quantile) Rraw 
3 NGRIP 88.6 28.8 103.8 44.7 175.8 53.6 
4 GISP2 35.5 11.7 42 17.9 70.4 21.7 
5 GRIP 113.4 36.8 132.6 57.1 224.9 68.4 
30 MD01-2412 6.1 3:2 4 3:2 11.8 3.6 
34 PC-6 O:7 0.8 1 0.4 1.4 0.9 
35 MD03-2699 2 1:3 L7 0.9 4.4 LS 
38 MD01-2444 11.6 5.6 EL 5.8 22.9 6.3 
42 ODP976-4 43.5 21.9 86.2 22.2 
47 MD98-2195 15.8 74 9.5 6.8 33:1 8.4 
52 ODP658C-Zhao 1.8 132 1.6 0.9 3.6 1.4 
60 SK237-GC04 13 1 1.3 0.6 2.5 11 
62 ODP1240 4.1 2 8.8 2.6 
63 PLO7-39PC 1:3 a 13 0.6 2.7 1,2 
67 MD98-2181 4.7 2.5 3:3 2.1 9.9 2.9 
68 MD03-2707 0.8 0.8 1 0.4 1.6 0.9 
69 $0189-39KL 1 0.9 1.2 0.5 2 1 

81 MD01-2378 3 1.8 2.3 13 6.6 2 

84 MD97-2121 3:7 21 27 Le 8.7 2.4 
88 MD97-2120 8.1 4 5.1 3.9 17.4 4.5 
90 Talos Dome id 0.5 2.4 0.6 2.2 1 

91 Dome G 19 0.6 3.2 0.9 3.7 14 
92 EPICA DML 3:3 0.9 4.8 1.6 6.5 2.1 
93 EPICA Dome C 0.3 0.3 15 0.2 0.6 0.7 
94 Dome F 0.9 0.4 21 0.4 18 0.9 
96 Vostok 0.4 0.3 1.5 0.2 0.7 0.7 
97 WAIS Divide 5:3 1.4 7 2k 10.4 31 
98 Byrd 1.8 0.6 3.1 0.9 3:5 1.4 
99 Siple Dome 9.6 2.4 11.9 48 19 5:3 


The estimate used throughout this paper is the noise-corrected variance ratio Res (first data column). Reali (lower/upper) denotes the results for the variance ratios when using the calibration 
parameters with the lower/upper limits of the calibration uncertainty for the LGM and the upper/lower calibration uncertainty limits for the Holocene. Data columns four and five give the 5% and 95% 
quantiles of the estimate used (Rest). Data column six gives the raw, uncorrected ratio (Rraw). Numbers (first column) refer to the list of records given in Supplementary Information. For ODP976-4 and 
ODP 1240, no calibration uncertainty estimate is available. 
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Fluctuating interaction network and time-varying 
stability of a natural fish community 


Masayuki Ushio!?34, Chih-hao Hsieh>°’, Reiji Masuda®, Ethan R Deyle’, Hao Ye*!°, Chun- Wei Chang®, George Sugihara’ & 


Michio Kondoh! 


Ecological theory suggests that large-scale patterns such as 
community stability can be influenced by changes in interspecific 
interactions that arise from the behavioural and/or physiological 
responses of individual species varying over time!3. Although 
this theory has experimental support”*°, evidence from natural 
ecosystems is lacking owing to the challenges of tracking rapid 
changes in interspecific interactions (known to occur on timescales 
much shorter than a generation time)® and then identifying the 
effect of such changes on large-scale community dynamics. Here, 
using tools for analysing nonlinear time series®° and a 12-year-long 
dataset of fortnightly collected observations on a natural marine fish 
community in Maizuru Bay, Japan, we show that short-term changes 
in interaction networks influence overall community dynamics. 
Among the 15 dominant species, we identify 14 interspecific 
interactions to construct a dynamic interaction network. We show 
that the strengths, and even types, of interactions change with 
time; we also develop a time-varying stability measure based on 
local Lyapunov stability for attractor dynamics in non-equilibrium 
nonlinear systems. We use this dynamic stability measure to 
examine the link between the time-varying interaction network 
and community stability. We find seasonal patterns in dynamic 
stability for this fish community that broadly support expectations 
of current ecological theory. Specifically, the dominance of weak 
interactions and higher species diversity during summer months 
are associated with higher dynamic stability and smaller population 
fluctuations. We suggest that interspecific interactions, community 
network structure and community stability are dynamic properties, 
and that linking fluctuating interaction networks to community- 
level dynamic properties is key to understanding the maintenance 
of ecological communities in nature. 

The dynamics of ecological communities are influenced by inter- 
specific interactions occurring at multiple temporal and spatial scales. 
Earlier studies have focused mainly on long-term effects; specifically 
those that focus on the timescale of the birth—-death process (for 
example, predator-prey interactions)!”-'3 or those that relate com- 
munity stability to gross properties of the interaction network such 
as mean interaction strength, preponderance of weak interactions 
and species diversity'®'*!°. However, more recent theoretical and 
experimental studies have revealed that temporally variable ecological 
and/or biological responses (including physiological and behavioural 
responses) can have considerable effects on community dynamics). 
In other words, short-term responses such as adaptive resource choice, 
inter-habitat movement or physiological metabolic responses can in 
principle generate rapid changes in interaction strength, influence 
population dynamics and even reverse the classic relationship between 
community complexity and stability’. 


Although the arguments are compelling, evidence is lacking for 
whether and how short-term fluctuations in interspecific interactions 
influence the overall stability of ecological communities in nature. 
There are two main challenges here: (1) quantifying fluctuating 
interspecific interactions and (2) evaluating fluctuating community 
stability. First, there is the practical challenge of measuring rapidly 
changing multiple interactions as they occur in nature. Traditional 
approaches such as direct observation and experimental manipulations 
(for example, species exclusions) have provided insights into species 
interactions and their consequences for community dynamics*!*"”. 
For example, manipulative experiments have shown that the interac- 
tions of species are often variable and that this variability can strongly 
influence the dynamics of a local community*!”. However, as has 
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Figure 1 | Reconstructed interaction network of a subset of the Maizuru 
Bay fish community. Arrows indicating interspecific interactions are 
assigned on the basis of the results of convergent cross mapping (Extended 
Data Table 1). Blue and red colours indicate positive and negative 
interactions, respectively, calculated by the S-map method based on the 
12-year average. All fish images by R.M. 
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Figure 2 | Time-varying interspecific interactions in a subset of the 
Maizuru fish community. a, Fourteen interspecific interactions quantified 
by the S-map method. The x axis indicates the sampling time (2-week 


previously been shown’, these approaches are labour-intensive and 
are not feasible for studying large ecological communities in nature. 
Second, because interspecific interactions vary over time, the resultant 
community stability also varies*; this means that evaluating community 
stability is not at all straightforward. Natural ecosystems do not 
typically exhibit equilibrium dynamics’~*'* that would accommodate 
a standard calculation of stability. Thus, for non-equilibrium systems 
that possess intrinsic variability (that is, systems that exhibit nonlinear 
dynamics) the magnitude of population fluctuations (for example, 
coefficient of variation of abundance) may not be a good indicator of 
community stability or resilience, because there exists the potential for 
confounding effects. Here we look instead at a measure that accounts 
for nonlinear dynamics and that tracks community stability as it varies 
through time. Relating fluctuating interaction networks to community 
stability is crucial for understanding how natural ecological commu- 
nities are maintained. 

Fluctuating ecological interaction networks can be identified 
and measured with empirical dynamic modelling® *!*!°—tools 
based on attractor reconstruction that are specifically designed for 
analysing nonlinear dynamical systems from their time series®?!*!? 
(see Extended Data Fig. 1, Methods and Supplementary Information 
section 1). 

We begin by applying convergent cross mapping’, an empirical 
dynamic modelling causality test, to identify the linkages defining the 
interaction network for the fish community in Maizuru Bay, a 12-year- 
long monitoring study that collected observations once every two 
weeks”? (Extended Data Fig. 2). Overall, we identify 14 interspecific 
interactions among the 15 dominant fish species (Fig. 1, Extended Data 
Fig. 3 and Extended Data Table 1). Most of the detected interactions 
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are ecologically interpretable (Supplementary Information section 
2), and all the species—except Engraulis japonicus—have at least one 
interaction, which indicates that interspecific interactions have a 
non-trivial role in the community dynamics. 

The attractor for a set of causally related fish species is constructed by 
plotting their abundances as a point in a coordinate space in which the 
axes are the set of causally related species (see Methods), and tracing 
the position forward in time to delineate a trajectory’ (https://youtu. 
be/fevurdpiRYg). As the system travels along its attractor, S-maps 
(sequential locally weighted global linear maps) can be used to com- 
pute sequential Jacobian matrices”, the elements of which are partial 
derivatives that describe the changing interactions between species’; 
this is known as the multivariate S-map method®*"® (see Methods). 

Figure 2a shows that interactions in the Maizuru Bay fish commu- 
nity are not static; this contradicts a common assumption of ecological 
research. Instead, they change over time, as expected for a system with 
nonlinear dynamics (Extended Data Table 2). Of the 14 interspecific 
interactions, on average 8 are negative and 6 are positive. The right- 
skewed distribution of mean interaction strengths in Fig. 2b shows that 
the interaction network is dominated by weak links; this domination is 
hypothesized to be a stabilizing property'*’>. There is also a clear sea- 
sonal pattern at the community level; weak interactions become more 
dominant during summer months than during winter months (Fig. 3), 
with a lower median:maximum interaction strength ratio (as this index 
decreases, weak interactions become more dominant). These fluctu- 
ations in interaction strengths could be driven by a number of mech- 
anisms acting independently or together; these include time-varying 
behavioural and/or physiological responses'~°, fluctuations in species 
diversity”’, or a weakening of interactions among fish species due to 
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Figure 3 | Time-varying stability, interaction strength and Simpson’s 
diversity index. a, The dynamic stability of the fish community. 
Dynamic stability is computed as the absolute value of the real part of the 
dominant eigenvalue of the interaction matrix at each time point. The fish 
community tends to recover from perturbations if the dynamic stability 
is lower than 1 (dashed line). b-d, Mean interaction strength (b), weak 
interaction index (median interaction strength:maximum interaction 
strength ratio; a lower ratio indicates increasing dominance of weak 
interactions) (c) and Simpson’s diversity index (d) of the fish community. 
White or grey shading delineate each 1-year interval (from January to 
December). 


higher primary productivity during the summer that results in higher 
fish abundance”. 

Community stability at each time point is evaluated by computing 
the dominant eigenvalue of the time-varying interaction matrix: this 
sequentially computed ‘local Lyapunov stability’ is hereafter referred to 
as ‘dynamic stability’ (see Methods). Our analysis reveals that dynamic 
stability varies in a non-random way (Extended Data Table 2); commu- 
nity dynamics are mainly stable in the summer (that is, the dynamic 
stability is less than 1.0; Fig. 3a), and unstable in the winter (that is, the 
dynamic stability is more than 1.0; Fig. 3a). Sensitivity analyses show 
that dynamic stability is robust when including less abundant species 
in the analysis, as well as when incorporating observation errors in the 
census data (Extended Data Fig. 4 and Supplementary Information 
sections 3, 4). Finally, we find that the stable time period (dynamic 
stability < 1.0) contains smaller variations in population abundances 
than the unstable period (Extended Data Fig. 5 and Supplementary 
Information section 5), and this supports the proposition that 
population fluctuations reflect community stability in the Maizuru Bay 
fish community, given that the level of stochastic noise through time 
is relatively constant. 

We identify two large-scale properties as being responsible for 
the fluctuation in dynamic stability (Fig. 4a—-c; see also Extended 
Data Figs 6, 7): overall interspecific interaction strength and species 
diversity. Figure 4d shows that the dominance of weak interactions 
is the strongest driver of dynamic stability, with the largest absolute 
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Figure 4 | Causal influences between dynamic stability, interaction 
strength and Simpson’s diversity index. a—c, Convergent cross mapping 
between dynamic stability and properties of the interspecific interaction 
network (a, b), and Simpson's diversity index (c). Solid lines indicate cross- 
map skill (g) from dynamic stability to another variable, which represents 
the causal influence of that variable on dynamic stability. Shaded regions 
indicate 95% confidence intervals of 100 surrogate time series. Cross- 

map skills (9) reported in a-c were all significant. d, Influences of mean 
interaction strength, weak interaction index and diversity on dynamic 
stability (n= 240 calculated partial derivatives for each index). Midline, 
box limits, whiskers and points indicate median, upper and lower 
quartiles, 1.5x interquartile range and outliers, respectively. Note that 
smaller values of dynamic stability are indicative of stability (see Fig. 3 and 
Methods). 


effect. Therefore, the co-occurrence of weaker interactions and stable 
conditions during summer (Fig. 3a, c) seems to reflect a true causal 
relationship. This provides empirical support for the theory that weak 
interactions are stabilizing'*”*. The analysis also identifies species 
diversity (Simpson’s diversity index) as a stabilizing factor (Fig. 4c), 
supporting recent findings from manipulative experiments”* and 
addressing the long-standing question of how species diversity influ- 
ences community dynamics’!!”34, Our results show that the Maizuru 
Bay fish community tends to recover faster from perturbations when 
species diversity is higher. In fact, higher species diversity seems to be a 
necessary condition for the dominance of weak interactions (Extended 
Data Fig. 8) and more stable communities (Fig. 4d). Because diversity 
seems to be a weaker driver of stability than interaction strength, it is 
likely that the latter is a more proximate driver of dynamic stability. 
Further investigations drawing on additional observational time series 
could reveal whether and how different interspecific interactions— 
for example, diet choice, anti-predator defence and inter-habitat 
movement—are involved in the maintenance of the Maizuru Bay fish 
community. 

Here we present a framework based on attractor reconstruction from 
observational time series that quantifies the dynamic nature of the 
community interaction network and provides an estimate of dynamic 
stability. Although the exact individual-level behaviour that gives rise to 
the interspecific effect cannot be addressed by this analysis, the analysis 
does enable quantitative identification of the essential interactions that 
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influence community dynamics. Further applications of this framework 
to ecological time series in different geographical regions—for exam- 
ple, Arctic and tropical regions’—will enable tests of the generality 
of the present results, and aid in identifying other critical patterns in 
the dynamic stability of natural ecological communities. Such applica- 
tions of empirical dynamic modelling could also clarify the relation- 
ships between interaction strengths, properties of the distribution (for 
example, the dominance of weak interactions, skewness and standard 
deviations), network structure (for example, arrangements and topol- 
ogies) and community dynamics (such as the relationship between 
dynamic stability and population variation observed in this study), 
enabling a more in-depth investigation of the mechanisms by which 
dynamic interactions and species diversity govern the behaviour of a 
wide range of natural ecosystems. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 13 February 2017; accepted 9 January 2018. 
Published online 7 February 2018. 


1. Kondoh, M. Foraging adaptation and the relationship between food-web 
complexity and stability. Science 299, 1388-1391 (2003). 

2. Reynolds, P. L. & Bruno, J. F. Multiple predator species alter prey behavior, 
population growth, and a trophic cascade in a model estuarine food web. 
Ecol. Monogr. 83, 119-132 (2013). 

3. McMeans, B. C., McCann, K. S., Humphries, M., Rooney, N. & Fisk, A. T. Food 
web structure in temporally-forced ecosystems. Trends Ecol. Evol. 30, 662-672 
(2015). 

4. Gratton, C. & Denno, R. F. Seasonal shift from bottom-up to top-down impact 
in phytophagous insect populations. Oecologia 134, 487-495 (2003). 

5. Navarrete, S. A. & Berlow, E. L. Variable interaction strengths stabilize marine 
community pattern. Ecol. Lett. 9, 526-536 (2006). 

6. Deyle, E. R., May, R. M., Munch, S. B. & Sugihara, G. Tracking and forecasting 
ecosystem interactions in real time. Proc. R. Soc. Lond. B 283, 20152258 
(2016). 

7. Sugihara, G. et al. Detecting causality in complex ecosystems. Science 338, 
496-500 (2012). 

8. Sugihara, G. & May, R. M. Nonlinear forecasting as a way of distinguishing 

chaos from measurement error in time series. Nature 344, 734-741 (1990). 

9. Sugihara, G. Nonlinear forecasting for the classification of natural time series. 

Philos. Trans. R. Soc. A 348, 477-495 (1994). 

10. Allesina, S. et a/. Predicting the stability of large structured food webs. 

Nat. Commun. 6, 7842 (2015). 

11. May, R. M. Will a large complex system be stable? Nature 238, 413-414 

(1972). 

12. Tang, S., Pawar, S. & Allesina, S. Correlation between interaction strengths 

drives stability in large ecological networks. Ecol. Lett. 17, 1094-1100 

(2014). 

13. Mougi, A. & Kondoh, M. Diversity of interaction types and ecological 

community stability. Science 337, 349-351 (2012). 

14. McCann, K., Hastings, A. & Huxel, G. R. Weak trophic interactions and the 
balance of nature. Nature 395, 794-798 (1998). 


4 | NATURE | VOL 000 | 00 MONTH 2018 


15. Wootton, K. L. & Stouffer, D. B. Many weak interactions and few strong; 
food-web feasibility depends on the combination of the strength of 
species’ interactions and their correct arrangement. Theor. Ecol. 9, 185-195 
(2016). 

16. Wootton, J. T. & Emmerson, M. Measurement of interaction strength in nature. 
Annu. Rev. Ecol. Evol. Syst. 36, 419-444 (2005). 

17. Berlow, E. L. Strong effects of weak interactions in ecological communities. 
Nature 398, 330-334 (1999). 

18. Dixon, P.A., Milicich, M. J. & Sugihara, G. Episodic fluctuations in larval supply. 
Science 283, 1528-1530 (1999). 

19. Ye, H. et a/. Equation-free mechanistic ecosystem forecasting using empirical 
dynamic modeling. Proc. Nat! Acad. Sci. USA 112, E1569-E1576 (2015). 

20. Masuda, R. et al. Fish assemblages associated with three types of artificial 
reefs: density of assemblages and possible impacts on adjacent fish 
abundance. Fishery Bull. 108, 162-173 (2010). 

21. Allesina, S. & Tang, S. Stability criteria for complex ecosystems. Nature 483, 
205-208 (2012). 

22. Bascompte, J., Melian, C. J. & Sala, E. Interaction strength combinations and 
the overfishing of a marine food web. Proc. Nat! Acad. Sci. USA 102, 
5443-5447 (2005). 

23. Downing, A. L., Brown, B. L. & Leibold, M. A. Multiple diversity-stability 
mechanisms enhance population and community stability in aquatic food 
webs. Ecology 95, 173-184 (2014). 

24. Hector, A. et al. General stabilizing effects of plant diversity on grassland 
productivity through population asynchrony and overyielding. Ecology 91, 
2213-2220 (2010). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank members of the Kondoh laboratory in Ryukoku 
University; F. Grziwotz, A. Telschow and T. Miki for discussions; S.-l. Nakayama 
for advice on the twin surrogate method; and T. Yoshida and M. Kasada for 
providing time series of the algae—rotifer system. This research was supported 
by CREST, grant number JPMJCR13A2, Japan Science and Technology 
Agency; KAKENHI grant number 15K14610 and 16H04846, Japan Society 
for the Promotion of Science; Foundation for the Advancement of Outstanding 
Scholarship (Ministry of Science and Technology, Taiwan); DoD-Strategic 
Environmental Research and Development Program 15 RC-2509; Lenfest 
Ocean Program 00028335; NSF DBI-1667584; NSF DEB-1655203; the 
McQuown Fund and the McQuown Chair in Natural Sciences (University of 
California, San Diego). 


Author Contributions M.U., C.H. and M.K. designed the research programme; 
R.M. collected fish monitoring data; M.U. and G.S. conceived the idea of 
computing local Lyapunov stability from S-maps; M.U. performed analysis 
with help from C.H., E.R.D., H.Y. and C.-W.C.; M.U., C.H. and M.K. wrote the first 
draft of the paper; and all authors were involved in interpreting the results, and 
contributed to the final draft of the paper. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the paper. 
Publisher's note: Springer Nature remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. Correspondence and 
requests for materials should be addressed to M.U. (ong8181@gmail.com) or 
M.K. (kondohmjp@icloud.com). 


Reviewer Information Nature thanks J. Bascompte, U. Brose and K. McCann for 
their contribution to the peer review of this work. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


METHODS 


Fish community time-series data. Long-term time-series data of the fish 
community were obtained by underwater direct visual census conducted approxi- 
mately once every two weeks along the coast of the Maizuru Fishery Research 
Station of Kyoto University (Nagahama, Maizuru: 35° 28’ N, 135° 22’ E) from 
1 January 2002 to 2 April 2014 (285 time points during approximately 12 years)”°. 
This high-frequency census enables the detection of short-term interspecific 
interactions. The area was within 50m of the shore and at a water depth of 1-10 m. 
A 600-m visual transect line, composed of three parts, was set: each transect line 
was 200 m long with a 2-m-width survey area. Each transect included a rocky 
reef, brown algae macrophyte and filamentous epiphyte vegetation, live oysters 
(Crassostrea gigas) and their shells, a sandy or muddy silt bottom and an artificial 
vertical structure that functioned as a fish reef. The vegetation in the area was 
dominated by Sargassum tortile and Sargassum thunbergii on the rocky substrate 
and patches of Zostera marina on the shallow (1-2 m) sandy bottom substrate. 

Species and sizes of individual fish observed within 1 m of each census line (thus 
triplicated in the survey area) were recorded on waterproof paper. Each census was 
conducted on a sunny day and commenced around 12:00 with high tide of 2-3 h. 
The census was undertaken by diving using scuba equipment. Water temperature 
was measured near the surface and at the deepest point of the census line (10-m 
depth) during the diving. Visibility ranged from 1 to 15 m but was normally 3-5 m. 
Daily observations at 10:00 revealed that surface water temperature and salinity 
in the area ranged from 1.2 to 30.8°C and from 4.14 to 34.09 parts per thousand, 
respectively. The mean +s.d. surface salinity was 30.0 + 2.9 (n= 1,753) and did not 
show clear seasonality. Importantly, the same scientist conducted this field survey 
throughout the 12-year research term. Thus, inconsistency in fish identification 
and counting is diminished in the time series. 

We selected dominant fish species (that is, with a total observation count that 

was larger than 1,000) for analyses, because rare species that were not observed 
during most of the census term (that is, with many zero values) were not suitable for 
the time-series analysis. We used time series of 14 fish species and 1 jellyfish species 
(Extended Data Fig. 2). Jellyfish data were included because jellyfish are abun- 
dant in this region, and are thought to have notable influences on the community 
dynamics of fishes’. Including less abundant species in the analyses does not 
change the conclusion (see Extended Data Fig. 4). Before the analyses, the time 
series were normalized to unit mean and variance”*. 
Convergent cross mapping. Convergent cross mapping (CCM) was performed 
to determine the causal relationships among the 15 dominant species in Maizuru 
Bay. CCM is based on Takens’s theorem for nonlinear dynamical systems. For 
multi-variable dynamical systems in which only some of variables are observable, 
Takens’s theorem?”—with several extensions (for example, ref. 28 and references 
therein)—proves that it is possible to represent the system dynamics in a state space 
by substituting time lags of the observable variables for the unknown variables. 
The information in the unobserved variables is encoded in the observed time 
series, and a single time series can therefore be used to reconstruct the original 
state space. This gives a time-delayed coordinate representation (or embedding) of 
the system trajectories, and this operation is sometimes referred to as state space 
reconstruction. 

An important consequence of the state space reconstruction theorems”’ is that 
if two variables X and Y are part of the same dynamical system, then the recon- 
structed state spaces of X and Y will topologically represent the same attractor (with 
a one-to-one mapping between the reconstructed attractors of X and Y). Therefore, 
it is possible to predict the current state of variable X using time lags of variable Y. 
We can look for the signature of a cause variable (for example, X) in the time series 
of an effect variable (for example, Y) by testing whether there is a correspond- 
ence between their reconstructed state spaces (that is, cross mapping)’. In this 
study, cross mapping from one variable to another was performed using simplex 
projection®. In the simplex projection, a set of neighbouring points of Y(t) that 
have similar historical processes to those of Y(t) can find their time-corresponding 
points of X. If the time-corresponding set in X is also the neighbouring points 
of X(t), then it is possible to estimate X(t) accurately by cross-mapping. The 
cross-mapping skill (that is, predictability) can be evaluated using the correlation 
coefficient () between cross-map estimates and observations. Significant skill of 
the cross mapping is one necessary condition for the detection of causality (see the 
test for significance in ‘Phase-lock twin surrogate method’). 

In addition, the cross-map skill will increase as the library length (that is, the 
number of points in the reconstructed state space of an effect variable) increases if 
two variables are causally coupled’ (that is, convergence). As the number of points 
in the state space increases (that is, the time series becomes longer), the trajectory 
defining the attractor fills in, which results in closer nearest neighbours and declining 
estimation error (a higher correlation coefficient). Therefore, convergence is 
another necessary condition for the detection of causality. Practical criteria for 
the causality are described later (see ‘Phase-lock twin surrogate method’). 
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CCM results are sensitive to the choice of embedding dimension (that is, how 

many time-lag coordinates are used for state space reconstruction). Therefore, 
the embedding dimension (E) should be carefully determined. In this study, 
Ewas determined by evaluating out-of-sample predictability through trials of dif- 
ferent embedding dimension, using univariate simplex projection® as previously 
described?’. In brief, the value for E that gives maximum predictability was chosen, 
which ensured that E was sufficiently large to capture the dynamics of the system 
without including extraneous dimensions. Predictability can be measured using 
mean absolute error, root mean squared error or correlation (7) between predic- 
tions and observations. To determine the best E value we used mean absolute 
error, as described previously”’. The best E value was examined from 1 to 24, as 
our dataset came from a census conducted once every two weeks (24 points per 
year), which allows for the influence of previous populations (up to one year prior). 
After the determination of the best E value, CCM was applied to the normalized 
abundance time-series data of the dominant fishes. 
Phase-lock twin surrogate method. Most of our data show strong seasonality 
(Extended Data Figs 2, 6), and time-series data with strong seasonality often exhibit 
a high cross-map skill even when there is no causal relationship between variables 
(Extended Data Fig. 3). This means that synchronization driven by seasonality 
can lead to misidentification of causality (that is, false positives). To deal with 
this problem, we developed the phase-lock twin surrogate method, which takes 
seasonality into account. This new method is modified from the twin surrogate 
method", which generates time series that preserve the shape of an attractor but 
exhibit no causal relationship with a target time series. 

The twin surrogate time series is generated by the following steps: (1) construc- 
tion of a recurrence matrix, (2) identification of twin points on the recurrence 
matrix, (3) selection of the starting point of the surrogate, and (4) generation of 
the surrogate time series by switching trajectories at the twin points. In brief, the 
recurrence matrix is constructed using the following equation: 


Rij = O( — | |x) — xG) I) 


in which © denotes the Heaviside function (O(a) is 1 if a is positive and 0 
otherwise), ||-|| indicates the maximum norm, 6 is a predefined threshold and x(i) 
denotes the system state at time i. We initially set 5= 0.125. Results do not change 
significantly for 5 values between 0.05 and 0.20°°. By coding Rij=1 in the matrix 
as black dots and R;;=0 as white dots, we obtained recurrence plots. The recur- 
rence plot contains all topological information about the attractor. In the recur- 
rence plot there can be identical columns; that is, Ry,; = Rx, Vk, called twins. 
These two points are not only neighbours but also share the same neighbourhood. 
Twins are special points of the time series that are indistinguishable in the recur- 
rence plot, but have different pasts and futures in time series. After identifying the 
twins in the recurrence plot, we randomly chose an entry (/) of the original time 
series (x(/)), as the starting point of the surrogate time series, x°(1). Then, we 
followed the trajectory in the state space—x*(1) = x(/), x°(2) = x(/+ 1), 
x°(3) =x(/+ 2) and so on—in order to generate the surrogate. If we suppose that 
entry m of the surrogate x*(m) may be given by entry j of the original time series 
(x(j)), ifx(j) has no twins we set x°(m + 1) =x(j+ 1). If x(k) is a twin of x(j), we set 
(m+ 1) =x(j +1) or (m+ 1) =x(k + 1) with equal probability. This step is iter- 
ated until the surrogate time series reaches the same length as the original time 
series. By performing the above steps, the twin surrogate time series preserves only 
the nonlinearity (that is, the shape of the attractor), but not any of the remaining 
causal influences. 

To take seasonality into account, we added another constraint to the twin 
surrogate method: twins in the recurrence plot must also be from the same season 
(that is, phase-locking). In our dataset, there are 24 points per year; therefore, if 
a reconstructed state represents an observation in early January, only observa- 
tions in early January (from a different year) can be considered for switching of 
the trajectory. With this constraint, the surrogate time series preserves both the 
seasonality and the shape of the attractor (Extended Data Fig. 3b). 

To evaluate the phase-lock twin surrogate method, we generated artificial time 
series with seasonality by using the following equations: 


Xo(t + 1) = Xo(t){4— 3 Xol) + By Yo) 
Yo(t + 1) = Yo(t){3.8 — 3.8 Yo(t) + By Xo(t)} 


X(t+ 1) = Xo(t+ 1) + a, seasonality 


Y( 


™ 


+1) = Y(t + 1) + ay seasonality 


in which (,, and (3,, indicate interspecific interactions, and a, and ay indicate 
the strength of the seasonality. ‘Seasonality’ is defined by a sine curve. In this 
analysis, the time-series length is 288 (that is, equivalent to a 12-year census with 
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24 observations per year). To examine the effects of seasonality on the cross-map 
skill (p) and convergence, we set = 3y,=0 (no causality between X(f) and Y(t)) 
and a, =a,=0.3 (moderate seasonality). We found that even a moderate strength 
of seasonality resulted in a relatively high predictability and convergence of cross- 
map skill (a false positive) (Extended Data Fig. 3a). 

The phase-lock twin surrogate method generates time series with the same non- 
linearity and seasonality (Extended Data Fig. 3b). Intuitively, a.,= ay = 1.0 indicates 
strong seasonality in the time series (Extended Data Fig. 3b and Supplementary 
Information section 6). By changing a, and ay from 0 to 2.0, we examined the 
performance of the new surrogate method under various strengths of seasonality. 
We examined conditions of no interaction between X and Y (G.,= 3,.=0), 
unidirectional interaction from X to Y and bidirectional interactions between X 
and Y (Extended Data Fig. 3e-h). 

Analyses of model simulations show that the phase-lock twin surrogate method 

gives a conservative criteria (that is, low possibility of false positives) for detecting 
causality among time series that exhibit seasonality (Extended Data Fig. 3). In 
this study, the CCM was regarded as significant if the following two criteria were 
satisfied: (1) cross-mapping for the real time series showed higher skill (7) than 
95% confidence intervals of phase-lock twin surrogate data (that is, significant 
predictability), and (2) the difference between the cross-map skill at the smallest 
and largest library sizes (Ap) was larger than 0.1 (that is, convergence). Although 
Ap>0 isa minimum and simpler criterion for convergence, observation error 
can cause fluctuations in p and Ap, and we therefore used a more conservative 
criterion instead. 
The multivariate S-map method. The multivariate S-map method enables quan- 
tification of dynamic (that is, time-varying) interactions”. If we consider a system 
that has E different interacting species, and the state space at time f is given by 
x(t) = {x;(t), x2(t), ..., xe()}, for each target time point ¢*, the S-map method 
produces a local linear model C that predicts the future value x;(¢* + p) from the 
multivariate reconstructed state-space vector x(t*). That is, 


E 
Xy(t* +p) =Cot+ ss Cj x;(t*) 
j=l 


The linear model is fit to the other vectors in the state space. However, points that 
are close to the target point x(t*) are given greater weighting. The model C is the 
singular value decomposition solution to the equation B= A-C, in which B is an 
n-dimensional vector (n is the number of observations) of the weighted future 
values of x;(t;) for each historical point t;, given by 


Bj = w((||x(ti) — x(t*)||)a(tit p) 


A is then the n x E dimensional matrix given by 
Ag = w((|x(ti) — x(t*)||) (40) 


The weighting function w is defined by 
6d 
d)= — => 
w(d) es 4 | 


which is tuned by the nonlinear parameter 0 2 0 and normalized by the average 
distance between x(¢*) and the other historical points, 


d=+Y ||x(t) —x(¢)| 
ni 


The Euclidian distance between two vectors in the E-dimensional state space is 
given by ||x— ||. Note that the model C is separately calculated (and thus poten- 
tially unique) for each time point t. The coefficients of the local linear model (C) are 
a proxy for the interaction strength between variables®; these interaction strengths, 
defined as partial derivatives in a multidimensional state space, quantify the 
population-level interaction between two species and do not assume any particular 
form of interaction, such as mutualism or competition. Instead, one might be able 
to infer the type of interactions after calculating the interaction strength. 
Evaluations of the multivariate S-map method. Here we also show that the 
long-term averaged interaction strength estimated from the multivariate S-map 
method is equivalent to the interaction coefficient, (3, in the following equations 
(Extended Data Fig. 1). These equations provide the explicit system of equations 
for the two-species model: 


X(t 


1) =X(O){r + sx X(O) + By, YO} 


Yt+D= Y(t){ry Sy Y(t) Bey X(t} 


in which r, and s, (for species x) or ry and s, (for species y) represent an intrinsic 
growth rate and a self-regulation term, respectively. /3,, is an effect of Yon X and 
Bry is an effect of X on Y. 

In a unidirectional two-species model, (3, was set to 0 and (3, was set to —0.31. 
Other parameters were set as follows: r.=4, Sy 3, ry =3.1 and sy 3.1. The 
length of the time series was 1,000 and the initial abundances of X and Y were set 
at 0.5. Before the multivariate S-map analysis, the time series were normalized to 
unit mean and variance. We used a fully multivariate embedding, {X(t), Y(t}, to 
reconstruct the attractor. 

Ina bidirectional two-species model, (3, was changed from —0.5 to 0.2, with 
an interval of 0.1. 3,, was changed from —0.5 to 0.25, with an interval of 0.005. 
Other parameters were set as follows: r, = 3.8, sy = —3.8, ry = 3.5 and s, = —3.5. 
The length of the time series was 1,000 and the initial abundances of X and 
Y were set at 0.5. The attractor was reconstructed using a fully multivariate 
embedding. 

To further test the effectiveness of the S-map method, we applied it to experi- 
mental systems in which signs of interactions were known on the basis of biological 
background knowledge about organisms. We applied the S-map method to two 
experimental systems; one was a classic predator-prey system, and the other was 
a more complex rotifer-algae system. 

The data of the classic Paramecium-Didinium protozoan prey-predator 
system have previously been published*! and can also be found at http:// 
robjhyndman.com/tsdldata/data/veilleux.dat (Extended Data Fig. le). A 
previous study identified conditions that produced sustained oscillations 
in predators (Didinium nasutum) and prey (Paramecium aurelia). Initial 
densities of Paramecium and Didinium in medium were 15 and 5 individuals 
per millilitre, respectively. Abundance measurements were taken every 12h. 
The first 10 data points were removed to eliminate transient behaviour in the 
initial period of the experiment. The time series were then normalized to unit 
mean and variance before analysis. 

The data of the rotifer-chlorella system were from an experimental predator- 
prey system*, The predator-prey system consisted of Brachionus calyciflorus 
(an asexually reproducing predatory rotifer) and Chlorella vulgaris (an asexually 
reproducing algal prey) (Extended Data Fig. 1g). One type of algal clone has a 
higher population growth rate, whereas another type of algal clone is more defen- 
sive against rotifer predation (hereafter algae r and algae K, respectively). The 
changes in clonal frequency in the algal population (that is, natural selection in the 
population) were quantified using the allele-specific quantitative PCR technique. 
Detailed experimental protocols were as described**. 

Among the 63 data points, 7 were not available (days 10, 14, 17, 18, 26, 57 and 

59). The missing data were estimated using simple linear interpolation. Before 
quantifying interaction strengths between the species, CCM was performed to 
detect causality. CCM detected causalities between all pairs in the system; thus, 
there are six causal relationships in the system. We then quantified interaction 
strength using the S-map method. The S-map method was performed using full 
multivariate embedding that is, {Rotifer(t), Algae_r(t), Algae_K(t)}. To forecast the 
abundance of the rotifer, algae r and algae K, nonlinear parameters (6) were set as 
0.1, 1.8 and 1.2, respectively. 
Reconstruction of the dynamic interaction matrix of the fish community. CCM 
with the phase-lock twin surrogate method identified 14 causal links among the 
fish species (Fig. 1 and Extended Data Table 1). To approximate the interaction 
matrix (Jacobian matrix) of the fish interaction network at each time point, we 
used the multivariate S-map method®’. 

The estimation of interaction strengths by the multivariate S-map method is 
sensitive to the choice of variables included in state space reconstruction, and 
these variables should therefore be determined carefully. In the analysis of the 
Maizuru fish community, variables included in the state space reconstruction were 
as follows: if species x; is causally influenced by x2 and x3, and if the best E of x; 
is 5 (determined by the simplex projection; see ‘Convergent cross mapping’), the 
state space is reconstructed by {xj(£), x2(t), x3(t), x1(t— 1), x1(t—2)}. That is, the 
number of variables used to reconstruct the attractor must be equal to the best E: 
to fulfil this requirement, the time lags of the target variable were included in the 
embedding space when the number of candidate species was smaller than E. This 
was done to capture fully the dynamics of the system without including extraneous 
dimensions. The best nonlinear parameter (#) was chosen for the multivariate 
embedding on the basis of the mean absolute error, as previously described”. 
In this study, the coefficients of the local linear model for x2(t) and x3(t) were 
regarded as interspecific interaction strengths, and other coefficients (that is, for 
x(t), x1(t— 1) and x;(t— 2)) were not interspecific and thus excluded from the 
calculations for indices of interspecific interactions (see later). 

Dynamic stability of the fish community and indices of interspecific 
interactions. The interaction strength quantified by the multivariate S-map 
method is an approximation of the partial derivative for each time point. Thus, 
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using the multivariate S-map method, population dynamics including time-lag 
effects are described as 


Ox(t+ 1) Ox(t + 1) i Ox,(t + 1) 


A(t+)= Bul x(t) 4 dul) x(t) + EO x(t) +... 
_ Ox(t+ 1) x(t +1) | 
| G1 Balt oo) N+. +Co 


in which % indicates the predicted abundance of species i in the community (% 
for species 1, in this example), and Cp indicates the intercept. Note that the partial 
derivatives and intercept are calculated using the S-map method as described ear- 
lier®. For simplicity, here we describe a case in which %(t + 1) includes only one 
time lag (x(t — 1)), but the following descriptions can easily be extended to cases 
that include more time-lag terms. In a matrix notation, the community dynamics 
are described as 


Xt+D=hX)+h X(t-1)+C () 


in which X indicates the n-dimensional vector of the abundance of n species, J; is 
the m x n-dimensional matrix of partial derivatives (interaction strengths), and 
Cis the n-dimensional vector of the intercepts. If we write the unity matrix, zero 
matrix and zero vector as I, O and O, respectively, then we can describe X(t) as 


X(t) =I X(t) +O X(t-1)+0 (2) 
By combining equations (1) and (2), we get the following: 


X(t+)]_(h h Cc 
=|) ‘| +(5 ©) 


X(t) 


x(t) 
X(t—1) 


If we write 


| a | = Wid, i : =Aand (S| =B, 
X(t—1) O 
Then equation (3) can be written as: 
Wit+)=AW(t) +B (4) 


Assuming that W* is the abundance of the steady state (note that assuming W* 
here does not assume the existence of the local stable equilibrium of the community 
dynamics), equation (4) can be written as 


W*=AW*+B (5) 


(I— A)W*=B (6) 


By substituting equation (5) from equation (4), we get 


W*= A(W(t) — W*) 
For the purpose of convenience, we write W(t+ 1) — W* as Z(t+ 1), and then 
equation (4) can be written as: 


Z(t+1)=AZ(t) 


Therefore, the stability of this system can be examined by investigating eigenvalues 
of the interaction matrix A, which correspond to the Lyapunov exponents. In this 
study, for the purpose of convenience, we describe the local Lyapunov stability 
as the absolute value of the real part of the dominant eigenvalue of the interac- 
tion matrix A, and this stability is called ‘dynamic stability. A dynamic stability 
value of less than 1 indicates that the community tends to recover faster from 
perturbations, if the interspecific interaction strengths (off-diagonal elements in 
the interaction matrix J;) and self-regulation effects (diagonal elements in the inter- 
action matrix J; and J>) are kept constant. Although our analysis may be analogous 
to local stability analyses, the calculation of dynamic stability does not require an 
assumption of a locally stable equilibrium: because the multivariate S-map method 
actually generates a state-dependent (and hence time-varying) interaction matrix, 
it is applicable to non-equilibrium systems and reflects whether the trajectories at 
any particular time are converging or diverging®. In addition, we computed several 
properties of the interaction network structure including mean, weak interaction 
index (indicated by the median interaction strength:maximum interaction strength 
ratio), standard deviations and skewness, using the absolute value of off-diagonal 
elements in J). Previous theoretical studies have suggested that these indices are 
potential drivers of community stability!1>1°, 

Sensitivity of the dynamic stability to the inclusion of subdominant species. 
In the main analyses, we included only a subset of the whole community; only 


LETTER 


dominant species were selected. To test the robustness of our analysis to the inclu- 
sion of a less abundant species, we performed a sensitivity analysis to look at how 
the dynamic stability is affected. In this sensitivity analysis, four subdominant 
species were chosen. These were Ditrema temminckii, Pseudoblennius cottoides, 
Takifugu niphobles and Takifugu poecilonotus. The total abundance of each of 
these species during the census term is between 1,000 and 100; furthermore, 
they do not show too many ‘0’ observations during the census period. Each of 
the subdominant species was added separately in reconstructing an interaction 
network. The same procedure described in the above sections was applied to the 
network reconstruction, quantification of interaction strength and calculation 
of the dynamic stability. 

Sensitivity of the dynamic stability to observation errors. To test the sensitivity of 
the dynamic stability to observation errors in the visual census data, we calculated 
the dynamic stability after artificial observation errors were added to the visual 
census data. In this analysis, we assumed that observation errors are proportional to 
the number of observed fish individuals. More specifically, observation errors were 
added using the following R script: error <- rnorm(1, mean=0, sd=error_percent * 
data). Error_percent represents the magnitude of observation errors added to the 
original value, and data indicates the number of fish individuals at a particular 
time point. 

Calculations of coefficient of variation. To compare the dynamic stability in this 
study and coefficient of variation (CV) of the fish community, we calculated mean 
values of CV of fish populations (that is, 15 fish species). First, we determined a 
target time point and selected three time points before and after the target point, 
which generated a three-month window. Then, we calculated the mean value, 
standard deviations and CV for each species within the window. The mean CV was 
calculated at each window by taking the average of the CVs of the 15 fish species. 
This procedure was repeated throughout the census period. 

Relationships between dynamic stability and other variables. Using CCM and 
the phase-lock twin surrogate method, we investigated the following possible causal 
drivers of dynamic stability: (1) measures of the strength of interspecific interac- 
tions such as mean interaction strength and weak interaction index (Fig. 4a, b); 
(2) diversity indices (such as species richness and Simpson's diversity index) (Fig. 4c 
and Extended Data Fig. 6); (3) water temperature and total fish abundance 
(Extended Data Fig. 6); and (4) measures of the distribution of interspecific inter- 
actions such as s.d. and skewness (Extended Data Fig. 6). 

Abundance-based stability index. An alternative and more intuitively under- 
standable measure of stability of the community dynamics would be the 
Euclidean distance of the species abundance between time point f and f+ 1, 
||W(t+ 1) — W(t)||. Here we show that this abundance-based stability index is 
directly related to the eigenvalue-based stability (that is, dynamic stability). From 
equation (4), the abundance-based stability can be written as 


W(t+1)— W(t) =(A—DW(t) +B (7) 
Using equation (6), equation (7) can be written as 


W(t+1) 


W(t) =(A—DW(t) +(1— A)W* 


W(t+ 1) — W(t) = (A— D(W() — W*) (8) 


Equation (8) indicates that the abundance-based stability can be expressed as the 
product of (A — I) and (W(t) — W*); this (W(t) — W*) describes the difference 
between the steady state and the abundance at time t. It is important to note that 
assuming W* does not assume the existence of the local stable equilibrium of 
the community dynamics. Equation (8) indicates how the difference between the 
steady state and the abundance at time ¢ will be amplified in the next time step. In 
other words, (W(t + 1)- W(f)), (A-J) and (W(t) - W*) indicate the community- 
level fluctuation, the potential to change the abundance of each population 
and how the present state differs from the steady state, respectively. The inter- 
specific interactions caused an abundance-based stability index (W(t+ 1)- W(t) 
(Extended Data Fig. 7), which suggests that interspecific interactions also drive 
fluctuations in the realized population abundance. 

Effects of interspecific interactions and species diversity on the dynamic 
stability. To determine how interspecific interactions and species diversity 
affect dynamic stability, we quantified the causal effect using the multivariate 
S-map method (Fig. 4d), which is described above. Before the analysis, all data 
were normalized. When using the multivariate S-map method, we used fully 
multivariate embedding (reconstructed state space = {DynamicStability(¢), 
MeanInteractionStrength(t), WeakInteractionIndex(t), Simpson’sDiversity(t), 
and time-lag terms}), and the partial derivatives were then calculated. 
Computation. Simplex projection, the S-map method and CCM were performed 
using the ‘rEDM’ package (version 0.2.4)!”, and all statistical analyses were 
performed in the statistical environment R v.3.2.1°*. 
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Code availability. The R scripts used for the main analyses are available at https:// 
doi.org/10.5281/zenodo.1039387. 

Data availability. Time-series data of the Maizuru fish community are available 
at https://doi.org/10.5281/zenodo.1039387. All other data are available from the 
corresponding author(s) upon reasonable request. 
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Extended Data Figure 1 | Effectiveness of the S-map method examined 
in two-species model systems and laboratory experiment systems. 

a, Illustration of the unidirectional two-species model system. X has 

a direct influence on Y, but Y does not have an influence on X. b, An 
example of the dynamics of the two-species system. The interaction 
strength from X to Y was set at —0.31 in this example. c, The estimation 
of interaction strength using the S-map method. True interaction strength 
is —0.31, whereas the mean of the S-map coefficients is —0.309. The length 
of the time series used for the analysis was 1,000. d, Test of the S-map 
method in a two-species bidirectional system. Interaction strength from 
Y to X was fixed for each panel (as denoted in the header of each panel), 
and interaction strength from X to Y was changed (x axis). The length of 


the time series used for each analysis was 1,000 (see Methods). Dashed 
lines indicate 1:1 lines. Dynamics that show strong linearity (for example, 
limit cycle and equilibrium) were excluded from the analysis; that is, 
regions around the origin were excluded. e, Population dynamics of 
Didinium (predator) and Paramecium (prey). f, Estimation of interaction 
strength between Didinium and Paramecium. g, Population dynamics 

of the rotifer (predator) and two types of algae (prey). Inset illustrates 

the three-species experimental system. R, Ar and AK indicate rotifers, 
r-strategy algae and K-strategy algae, respectively. Units for the y axis are 
10° cells per ml for the algae, and 10 individual females per ml for the 
rotifer. h, i, Estimation of pair-wise interaction strength among r-strategy 
algae, K-strategy algae and rotifers. 
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Extended Data Figure 2 | Time series of dominant fish species and jellyfish in Maizuru Bay in Japan. During a 12-year census (2002-2014), 285 
surveys were conducted. The width of the grey region corresponds to a 1-year interval that runs from January to December (24 observations per year). 
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Extended Data Figure 3 | Evaluation of the phase-lock twin surrogate 

method. a, A false high cross-map skill and convergence, owing to 

seasonality. We set 3, = 3),=0 (no causality between X(t) and Y(t) 

and a,=a,=0.3 (moderate seasonality). b, An example of the phase- 

lock twin surrogate time series. The original time series with strong 

seasonality is shown as a black solid line (Y(t); B= —0.3, By. =0, ax= 1.0 

and ay= 1.0). The surrogate time series, with the same seasonality and 
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nonlinearity as the original data, is shown as a solid red line. c—h, Cross- 
map skill (terminal p) and terminal  —95% upper limit; p of 100 surrogate 
data by CCM between X and Y, when X and Y have no interaction (c, d), 
unidirectional interaction (e, f) and bidirectional interaction (g, h). The 


length of the time series used for the evaluation was 288 (equivalent to a 
12-year census with 24 observations per year). 
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Extended Data Figure 4 | See next page for caption. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Extended Data Figure 4 | Sensitivity to the inclusion of subdominant 
species and observation errors. a—d, Relationship between the dynamic 
stability calculated from the community of 15 dominant species versus 
that of a 16-species community. A subdominant species (D. temminckii (a), 
P. cottoides (b), T. niphobles (c) or T. poecilonotus (d)) was added to 

the community of 15 dominant species, and the dynamic stability was 
calculated by the procedure described in the Methods. Inset shows 

the interaction network structures of the 16-species community. Solid 
black line indicates the 1:1 line. Red circle indicates the newly included 
subdominant species. Blue and red arrows indicate positive and 
negative time-averaged interactions, respectively, associated with the 
subdominant species. Grey arrows and circles indicate the edges and 
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nodes, respectively, of the original community of 15 dominant species. 
e-j, Effects of observation errors on the calculations of the dynamic 
stability. e, Observation errors were added to the original time series (see 
Methods), R? was calculated between the original dynamic stabilities and 
those calculated from the time series with an added error. This procedure 
was repeated 100 times for each error magnitude (%). Midline, box limits, 
whiskers and points indicate median, upper and lower quartiles, 1.5 x 
interquartile range and outliers, respectively (n = 100 for each box). 

f-j, Examples illustrating the relationships between the original dynamic 
stabilities versus those calculated after the addition of 1% (f), 5% (g), 

10% (h), 20% (i) and 30% (j) observation errors. The solid line indicates 
the 1:1 line. The dashed line indicates the dynamic stability = 1.0. 
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Extended Data Figure 5 | Relationship between dynamic stability and (n=56 for stable conditions and n = 203 for unstable conditions). Under 
coefficient of variation of fish abundance. a, Time series of mean values stable conditions (dynamic stability < 1.0), the CV is significantly lower 
of CV. CV was calculated using a moving window (window width = 6 time than it is under unstable conditions (P < 0.0001, two-sided t-test). 

points; 3 months) for population dynamics of each fish species. Mean Midline, box limits, whiskers and points indicate median, upper and lower 
values of CV were then calculated by averaging CV values of the 15 fish quartiles, 1.5 x interquartile range and outliers, respectively. 


species. b, Comparison of CV between stable and unstable periods 
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Extended Data Figure 6 | CCM between dynamic stability and surface 
water temperature, species richness, total fish abundance and the s.d. 
and skewness of the interaction strength distribution. a—c, Time series 
of surface water temperature (a), richness of dominant fish species (b) and 
total abundance of dominant fish species (c). The width of the grey region 
corresponds to a 1-year interval (24 observations per year). d-f, Results of 
CCM analysis between dynamic stability and surface water temperature (d), 
species richness (e) and total fish abundance (f). g—h, Results of CCM 
between the dynamic stability and s.d. of interaction strength (g) and 
skewness of the interaction strength distribution (h). Dark solid lines 
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indicate cross-map skill (p) from dynamic stability to another variable. 
Shaded regions indicate 95% confidence intervals of 100 surrogate time 
series. Significant cross-map skills (g) are highlighted in red (d-h). 

i, j, Correlations between median:maximum interaction strength (IS) 
(weak interaction index) and s.d. of interaction strength (i) and the 
skewness (j) (1 = 261 for each panel). The dynamic stability is indicated 
in blue. The weak interaction index and s.d. and skewness of interaction 
strength were predominantly linearly correlated, which suggests that the 
s.d. and skewness of interaction strength are alternative representations of 
the weak interaction index in our data. 
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Extended Data Figure 7 | Abundance-based stability index of the fish 
community. a, Temporal dynamics of the abundance-based stability 
index. Euclidean distance between W(t+ 1) and W(t) was calculated 

(see Methods for the definition of W(f)). Note that the abundance of each 
fish species was standardized before calculating the Euclidean distance. 
b-g, Results of CCM between the abundance-based stability and 
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interspecific interactions, species richness, diversity and surface water 
temperature. Dark solid lines indicate cross-map skill (p) from the 
abundance-based stability to another variable. Shaded regions indicate 
95% confidence intervals of 100 surrogate time series. Significant cross- 
map skills (p) are highlighted in red. 
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Extended Data Table 1 | Result of CCM for the dominant fish species in Maizuru Bay 


Jellyfish Engraulis Plotosus Sebastes Trachurus Girella Pseudolabrus Parajulis 
Jellyfish - -0.003 0.030 0.012 0.131 0.169 0.035 0.097 
Engraulis japonicus -0.005 - -0.003 0.046 0.067 0.125 0.144 -0.030 
Plotosus japonicus 0.077 -0.030 - 0.197 0.263 0.028 0.333 0.195 
Sebastes cheni 0.044 -0.022 0.112 - 0.311 0.339 0.269 0.334 
Trachurus japonicus 0.391 0.133 0.309 0.269 - 0.481 0.447 0.552 
Girella punctata 0.168 0.090 0.243 0.191 0.421 - 0.296 0.358 
Pseudolabrus sieboldi 0.418 -0.030 0.417 0.342 0.643 0.546 - 0.594 
Parajulis poecilepterus 0.188 0.029 0.071 0.570 0.370 0.425 0.359 - 
Halichoeres tenuispinis 0.215 0.110 0.072 0.360 0.394 0.324 0.319 0.688 
Chaenogobius gulosus 0.101 -0.058 0.049 -0.026 0.043 0.037 0.071 0.023 
Pterogobius zonoleucus 0.009 -0.086 0.062 0.087 0.287 0.138 0.408 -0.008 
Tridentiger trigonocephalus 0.022 -0.131 -0.002 0.075 0.142 0.103 -0.029 0.186 
Siganus fuscescens 0.078 0.018 0.175 -0.018 0.215 0.180 0.180 0.328 
Sphyraena pinguis 0.048 -0.045 0.004 -0.010 0.340 0.154 0.028 0.153 
Rudarius ercodes 0.092 0.056 0.069 -0.005 0.411 0.149 0.284 0.117 
Halichoeres Chaenogobius Pterogobius Tridentiger Siganus Sphyraena Rudarius 
Jellyfish 0.019 0.019 0.032 -0.025 0.064 0.032 0.077 
Engraulis japonicus -0.043 -0.061 -0.042 0.141 -0.059 -0.024 0.145 
Plotosus japonicus 0.019 0.064 0.053 0.337 0.047 -0.010 0.123 
Sebastes cheni 0.285 0.117 0.102 0.127 0.229 0.003 0.257 
Trachurus japonicus 0.415 0.190 0.296 0.215 0.266 0.465 0.499 
Girella punctata 0.229 -0.022 0.353 0.105 0.244 0.047 0.224 
Pseudolabrus sieboldi 0.567 0.336 0.371 0.339 0.401 0.204 0.530 
Parajulis poecilepterus 0.656 0.119 0.200 0.270 0.336 -0.042 0.313 
Halichoeres tenuispinis - 0.209 0.236 0.182 0.312 -0.027 0.374 
Chaenogobius gulosus 0.005 - 0.005 0.044 0.069 -0.027 0.091 
Pterogobius zonoleucus 0.095 0.243 - 0.002 0.086 -0.047 0.551 
Tridentiger trigonocephalus 0.232 0.325 0.125 - 0.084 0.130 0.241 
Siganus fuscescens 0.209 -0.014 -0.016 0.133 - 0.014 0.100 
Sphyraena pinguis 0.014 0.090 0.003 0.104 -0.021 - 0.153 
Rudarius ercodes 0.148 0.131 0.081 0.061 0.028 0.127 - 


The significance of the CCM was judged using two criteria: (1) the terminal p is higher than the 95% confidence intervals of phase-lock twin surrogate, and (2) the initial p minus the terminal p (Ap) is 
larger than 0.1. 
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Extended Data Table 2 | Dimensionality and nonlinearity of the stability index and interaction strength 


Index or interspecific interactions Best embedding dimension (£) Best nonlinear parameter (0) 


Dynamic stability > 24 2 
Individual interspecific interactions 

Cause species Effect species 

Trachurus japonicus Aurelia sp. 2 1.3 
Parajulis poecilepterus Sebastes cheni 10 0.8 
Sphyraena pinguis Trachurus japonicus 12 0.8 
Rudarius ercodes Trachurus japonicus az 1.9 
Plotosus japonicus Pseudolabrus sieboldi > 24 0.8 
Pterogobius zonoleucus Pseudolabrus sieboldi 6 2.4 
Halichoeres tenuispinis Parajulis poecilepterus 5 1.5 
Siganus fuscescens Parajulis poecilepterus 5 2.3 
Pterogobius zonoleucus Chaenogobius gulosus 6 1.2 
Tridentiger trigonocephalus Chaenogobius gulosus 7 0 
Girella punctata Pterogobius zonoleucus 4 1.2 
Plotosus japonicus Tridentiger trigonocephalus > 24 0 
Trachurus japonicus Sphyraena pinguis 8 1.6 
Pterogobius zonoleucus Rudarius ercodes 3 0 


Embedding dimension was examined from E= 1 to 24, and the best nonlinear parameter was examined from 6=0 to 10. 
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Cognitive performance is linked to group size and 
affects fitness in Australian magpies 


Benjamin J. Ashton!, Amanda R. Ridley, Emily K. Edwards! & Alex Thornton? 


The social intelligence hypothesis states that the demands of social 
life drive cognitive evolution!°. This idea receives support from 
comparative studies that link variation in group size or mating 
systems with cognitive and neuroanatomical differences across 
species*-’, but findings are contradictory and contentious*"°. 
To understand the cognitive consequences of sociality, it is also 
important to investigate social variation within species. Here we 
show that in wild, cooperatively breeding Australian magpies, 
individuals that live in large groups show increased cognitive 
performance, which is linked to increased reproductive success. 
Individual performance was highly correlated across four cognitive 
tasks, indicating a ‘general intelligence factor’ that underlies 
cognitive performance. Repeated cognitive testing of juveniles at 
different ages showed that the correlation between group size and 
cognition emerged in early life, suggesting that living in larger 
groups promotes cognitive development. Furthermore, we found 
a positive association between the task performance of females and 
three indicators of reproductive success, thus identifying a selective 
benefit of greater cognitive performance. Together, these results 
provide intraspecific evidence that sociality can shape cognitive 
development and evolution. 

The social environment is commonly assumed to generate impor- 
tant cognitive challenges. According to the social intelligence (or social 
brain) hypothesis, these challenges, including the need to form and 
maintain social bonds, track third-party relationships and anticipate the 
actions of others, are the central drivers of cognitive evolution! 3. This 
argument receives widespread support from studies that link variation 
in social factors, such as group size or mating systems, with differences 
in cognitive performance or neuroanatomy across species of birds and 
mammals (for example, see refs 3-6). However, comparative analyses 
are subject to ecological and phylogenetic confounding effects, and 
have yielded conflicting results, with recent work calling into question 
the importance of social factors*"!°. To understand the role of sociality 
in cognitive evolution, it is critical to examine the causes and fitness 
consequences of cognitive variation within species'!”. 

For species that live in stable social groups, within-population 
variation in group size could generate differences in information- 
processing demands and so influence the expression of cognitive 
traits'*, Measurements of brain structure correlate with group size 
in humans, captive cichlids (Neolamprologus pulcher) and captive 
macaques (Macaca mulatta)'*-'5, but the relationship between group 
size and cognition in wild animals is unknown. Furthermore, the 
potential for group-size-dependent cognitive traits to come under 
selection is not understood, as their fitness consequences have not 
been investigated. To address these critical gaps in our knowledge, we 
examined whether group size predicts individual variation in cognitive 
performance (controlling for morphological, nutritional and behavi- 
oural factors) within a population of wild, cooperatively breeding 
Australian magpies (Western Australian subspecies, Cracticus tibicen 
dorsalis, also known as Gymnorhina tibicen dorsalis). We quantified 


individual cognitive performance in 56 birds from 14 groups, which 
ranged in size from 3 to 12 individuals, using a variety of cognitive 
tasks designed to measure inhibitory control (the ability to inhibit 
prepotent responses), associative learning, reversal learning and spatial 
memory (Extended Data Fig. 1). These four domain-general cognitive 
processes are thought to have an important role in a range of fitness- 
related behaviours in both social and asocial contexts!!"° (see Methods 
for details). 

Group size was the strongest predictor of adult performance across 
all four tasks (Supplementary Tables 1-4), with individuals from larger 
groups performing better than those from smaller groups (Fig. 1). 
Individual performance was significantly positively correlated across 
all four tasks (Supplementary Table 5), suggestive of an underlying 
general intelligence factor akin to what has been reported in human 
psychometric studies'”. A principal component analysis revealed that 
performance in all four tasks positively contributed to the first principal 
component (PC1; eigenvalue >1). This component (referred to hereafter 
as ‘general cognitive performance’) accounted for 64.6% of the total 
variance in task performance (Extended Data Table 1), a substantially 
higher proportion than has previously been shown for cognitive tasks 
in other species'*?*. Group size was also the strongest predictor of PC1 
(Fig. 2 and Supplementary Table 6). To confirm that our tasks provided 
robust measurements of individual cognitive performance, we ran a 
second set of cognitive tasks two weeks later using causally identical, 
but visually distinct, tasks (see Methods). Individual performance 
was highly repeatable in all four tasks: inhibitory control (r= 0.806, 
P<0.0001), associative learning (r=0.97, P< 0.0001), reversal learning 
(r=0.975, P< 0.0001) and spatial memory (r= 0.932, P< 0.0001) 
(Extended Data Table 2). 

To examine the development of the relationship between group size 
and cognition, we repeatedly tested juveniles at 100, 200 and 300 days 
after fledging. There was no evidence of general cognitive performance 
at 100 days after fledging (see Supplementary Discussion); however, 
much like adults, there was strong evidence for general cognitive per- 
formance at 200 (PC1 accounted for over 64% of total variance in task 
performance; Extended Data Table 3 and Supplementary Table 7) and 
300 days after fledging (more than 80% of total variance explained by 
PC1; Extended Data Table 4 and Supplementary Table 8). There was 
no relationship between group size and cognitive performance at 100 
days (Supplementary Tables 9, 10), but PC1 was strongly positively 
correlated with group size at 200 and 300 days (Fig. 3, Supplementary 
Tables 11, 12; see Supplementary Discussion for discussion of influen- 
tial data points). When analysed longitudinally, an interaction between 
age tested and group size was the best predictor of cognitive perfor- 
mance (Extended Data Fig. 2 and Supplementary Tables 13-18). 

The emergence of a positive association between group size and 
cognitive performance through early life supports the possibil- 
ity that living in large groups helps to drive cognitive development. 
Manipulations of group size would be required to demonstrate an 
unequivocal causal effect, which in wild populations would lead to 
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Figure 1 | The relationship between group size and cognition. 

a-d, The relationship between group size and cognitive performance in 
an inhibitory control task (a; n = 56 individuals), associative learning 
task (b; n = 48 individuals), reversal learning task (c; n = 48 individuals) 


strong logistical and ethical challenges (see Supplementary Discussion). 
However, our analyses allow us to address key alternative explanations. 
First, the increased cognitive performance of birds in large groups is 
unlikely to be explained by reduced nutritional constraints on cog- 
nitive development”’, because we found no effect of group size on 
offspring provisioning rates (Supplementary Table 19), and no rela- 
tionship between body size and cognitive performance in either adults 
or juveniles (Supplementary Tables 1-4, 9-12). We also found no 
relationship between foraging efficiency and cognitive performance 
in adults (Supplementary Tables 1-4; foraging efficiency data were 
not available for juveniles). Second, the positive effects of group size 
cannot result from a reduced need for vigilance or reduced neopho- 
bia (the fear of something new): we recorded no anti-predator behay- 
iour during any task presentations and neophobia was unrelated to 
performance on any tasks, except juveniles’ performance on the spa- 
tial memory task at 100 days after fledging (adults: Supplementary 
Tables 1-4; juveniles: Supplementary Tables 9-12). There was also no 
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Figure 2 | The relationship between group size and general cognitive 
performance. Individual measurements of general cognitive performance 
derived from principal components analysis. n = 46 individuals. 
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and spatial memory task (d; n = 49 individuals). Lines represent best fit. 
Performance is measured as either the number of trials taken to succeed 
with the task, or the number of locations searched, so lower scores indicate 
better performance. 


relationship between group size and the time that test subjects spent 
interacting with tasks (see Supplementary Discussion). Third, a link 
between cognitive performance and group size could potentially arise 
if magpies preferentially joined groups containing individuals with 
similar traits, but life-history data collected over more than four years 
provided no evidence of such social assortment (see Supplementary 
Discussion). Moreover, we found a clear difference in the frequency 
distribution of cognitive phenotypes between small and large groups 
(Extended Data Fig. 3), so it is not simply the case that larger groups 
have a wider distribution of cognitive phenotypes, and are therefore 
more likely to contain some high-performing individuals by chance. 
Instead, we propose that, as suggested by captive studies'*’>, living in 
larger groups presents wild animals with information-processing chal- 
lenges that promote the development of cognitive traits. Determining 
precisely what those challenges are is a priority for future research. An 
important next step will be to determine whether individual cognitive 
development is specifically linked to the quantity and quality of their 
relationships within their social networks, as might be expected if the 
need to establish and maintain multiple relationships within groups 
places cognitive demands on individuals’. 

To determine whether the group-size-dependent cognitive varia- 
tion that we have identified may be subject to selection, we examined 
the relationship between individual cognitive performance and three 
measures of reproductive success. General intelligence has been linked 
to fitness-related traits in humans”‘, but few studies have examined 
the fitness consequences of cognitive variation in wild animals", and 
the two studies that have used rigorous psychological tests found no 
effects'®”°. In our magpie population, exceptionally high rates of extra- 
group paternity*® mean that we were only able to reliably identify the 
mother of the brood (female reproductive skew in our population is 
low, and all females attempt to breed). Variation in female reproductive 
success was strongly linked to cognitive performance: general cognitive 
performance and foraging efficiency were the best predictors of the 
average number of hatched clutches per female per year (Fig. 4a, b and 
Supplementary Table 20), and general cognitive performance was the 
best predictor of the average number of fledglings produced and the 
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Figure 3 | The relationship between general cognitive performance 
and group size in juveniles. a, b, The relationship between general 
cognitive performance and group size at 200 days after fledging 


average number of fledglings surviving to independence per female per 
year (Fig. 4c, d and Supplementary Tables 21, 22). These effects were 
independent of group size (Supplementary Tables 20-22), indicating 
that fitness benefits arise as a direct consequence of increased cognitive 
performance and are not simply the result of the non-cognitive advan- 
tages of living in larger groups. These results provide the evidence of 
a potential selective benefit of high levels of general cognitive perfor- 
mance in a wild population of non-human animals. Precisely how these 
benefits arise, and whether increased cognitive performance incurs 
any costs”’, has yet to be determined. General cognitive performance 
and foraging efficiency are not correlated in female magpies (r= 0.06, 
P=0.791, n= 22), but it is possible that cognitively adept females 
may boost their reproductive success through improvements not in 
the quantity, but in the quality or variety of food given to offspring”®. 
Additional, non-mutually exclusive explanations for the relationship 
between cognition and reproductive success could include enhanced 
abilities to defend young by avoiding inter- and intraspecific conflict”, 
or heritable cognitive abilities that promote offspring survival*. It is 
also possible that the fitness benefits of cognitive performance may 
account for the relation between group size and cognition, if females 
with increased cognitive performance produce large numbers of 
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(a; n= 15 individuals) and 300 days after fledging (b; n = 10 individuals). 
General cognitive performance could not be computed at 100 days after 
fledging. 


cognitively adept offspring. However, this explanation is unlikely, given 
that group size is stable over time (see Methods), and the extraordinar- 
ily high rates of extra-group paternity” are likely to preclude substantial 
genetic differentiation between groups. 

Since its inception, the social intelligence hypothesis has focused 
on cognitive differences between species that result from selection in 
response to the challenges of social life. Our results indicate that social 
factors can also have developmental effects on cognition within species, 
with important consequences for individual fitness. In summary, we 
have shown that wild Australian magpies living in larger groups show 
increased cognitive performance, which is associated with increased 
reproductive success. The association between group size and cognition 
emerges through early life and cannot be explained by food intake, 
body size, neophobia, attention to tasks or social assortment. Although 
we cannot rule out the possibility that some other, unmeasured 
factor could have a role in driving the relationship, our findings strongly 
suggest that the social environment has developmental effects on fun- 
damental, domain-general cognitive traits. Furthermore, we provide 
rare evidence that cognitive performance provides benefits for female 
reproductive success. Recent comparative studies have brought into 
question the notion that variation in social structure drives cognitive 
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per year (b), general cognitive performance and the average number of 
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evolution®!°. However, our work highlights the critical importance of 
considering intraspecific variation, which is typically overlooked by 
comparative analyses. Together, our results point to a major role for 
the social environment in driving both the development and evolution 
of cognitive traits. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
allocation during experiments and outcome assessment. 

Study site and species. The study took place in Guildford, Western Australia, 
between September 2013 and February 2016. The study population consists of 14 
groups of ringed, habituated Australian magpies (Western Australian subspecies 
Cracticus tibicen dorsalis), with groups ranging in size from 3 to 12 individuals 
(for composition of study population, see Supplementary Table 23). The Western 
Australian subspecies breeds cooperatively and lives in territorial groups, in which 
the number of adults remains stable (individuals within our study population have 
remained in the same group since research commenced in 2013, and there have 
been no recordings of ringed birds moving between groups)”*"". Individuals 
exhibit a range of cooperative behaviours such as territory defence and allopa- 
rental care. Reproductive skew among females is very low, with all adult females 
typically attempting to breed each year*’, but extra-group paternity is the highest 
recorded for any bird species (>82%)”*, indicating high gene flow between groups. 
All of the group territories for our study population are located in urban parklands. 
Although individuals have access to food from anthropogenic sources, it is worth 
noting that all territories cover similar habitats and none contain dumps or landfills 
that could provide a glut of food sources. 

The majority of birds within our study population are colour-ringed and habit- 
uated to close human observation, allowing us to present cognitive tests to most 
individuals. Individuals are trained to hop onto electronic top-pan scales in return 
for a crumb of mozzarella cheese, allowing us to collect daily records of individual 
body mass. Mozzarella was also used as the food reward in the cognitive tests. 
Weekly behavioural focal follows are carried out on all individuals in the study 
population*’, from which foraging efficiency is calculated (defined as the mass of 
food (in grams), caught per foraging minute; biomass of food items was calculated 
using ref. 33). 

Adult cognitive test battery. We carried out a series of cognitive tests on 56 adult 
Australian magpies. The battery consisted of four tasks designed to measure inhibi- 
tory control, associative learning, reversal learning and spatial memory (Extended 
Data Fig. la-c). All individuals were tested on the tasks in this order. We chose 
these tasks, because (i) they target well-understood and widely studied cognitive 
traits spanning cognitive domains!!”°** and (ii) they are likely to be highly ecologi- 
cally relevant: spatial memory is likely to be important in remembering locations of 
resources and territory boundaries*, while associative and reversal learning enable 
the acquisition and flexible readjustment of predictive contingencies between cues 
in the environment, including learning from conspecifics’ behaviour!!3476%7, 
Finally, inhibitory control, the ability to inhibit prepotent responses, has been 
implicated in adaptive decision-making in both social and asocial contexts'®”>**. 
Inhibitory control. To quantify individuals ability to inhibit ineffective prepotent 
responses towards food, we presented individuals with a detour reaching task”. 
This consisted of a transparent open-ended cylinder (13 cm length, 5 cm diameter; 
Extended Data Figs 1a, 4a) in which a food reward was placed in the centre. Test 
subjects were presented with the task such that the open ends of the cylinder were 
facing away from the individual's direction of gaze. A trial was deemed successful 
if the test subject inhibited the prepotent response of pecking the transparent 
cylinder, and detoured around to the open ends of the cylinder to gain access 
to the food reward. Once an individual successfully detoured to the open ends 
of the cylinder without pecking the transparent walls three times in a row, it was 
considered to have succeeded at the inhibitory control task. The number of trials 
taken to succeed was the measure of success. Trials were carried out at one-minute 
intervals with a maximum of 10 trials, and when possible, all trials were carried 
out on the same day. Individuals that failed to pass were assigned the maximum 
score of 10 for statistical analyses. Other studies using the detour reaching task 
commonly include a training phase in which test subjects are presented with an 
opaque tube before being exposed to the transparent tube (for example, ref. 25). 
We did not include the opaque phase in our study, because it generates difficulties 
in interpretation: success in the transparent condition could be linked to inhibitory 
control, or could result from the continued application of a learned rule: pecking at 
the open ends of the cylinder was rewarded in the opaque condition, so individuals 
may persist with this behaviour in the transparent condition. 

Associative learning. To test associative learning, we used a colour-discrimination 
task consisting of a wooden foraging grid (31 x 9 x 4cm) containing two wells 
(3.5cm diameter, 2.5cm deep; Extended Data Fig. 1b). The presence of only two 
wells allowed experimental trials to be carried out quickly, reducing the chance of 
non-focal birds detecting and approaching the task. The wells were covered with 
PVC lids that fitted exactly into the wells, and were held in place by elastic bands 
that were threaded through drilled holes in the lids and fastened to either side 
of the well (Extended Data Fig. 5). This created an axis on which the lids could 
swivel when pecked. Birds were first trained to search the wells using a shaping 
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procedure similar to what is described in ref. 39: magpies could gain access to a 
food reward (a small amount of grated mozzarella cheese) by first being exposed 
to the wells without any lids covering them, second with the lids partially covering 
the well, and third with the lid fully covering the well. Lid colour in the training 
phase was yellow, a colour not used in any of the experimental trials. Once a bird 
had successfully searched the wells when fully covered by lids three times in a row, 
it moved onto the experimental trials of the associative learning task. 

During experimental trials, the wells were covered by either a dark-blue or light- 
blue lid. One of these two colours was randomly assigned to be the rewarded colour 
for each of the test subjects. We used dark and light shades of one colour, rather 
than distinct colours (for example, red versus yellow), in order to minimize any 
potential effects of past experience with particular colours on task performance””. 
Following ref. 20, test subjects were allowed to search both wells in the first trial to 
demonstrate that only one of the wells contained a food reward. In all subsequent 
trials, the bird was only allowed to search one well before the task was removed. 
Test subjects had a maximum of one minute to complete the task. There was a 
minimum interval of one minute between trials (mean +s.d.= 1.06 + 0.35 min; 
range = 1-6 min) with a maximum of 50 trials per individual per day; differences in 
inter-trial interval were unrelated to group size (Spearman's correlation, rs = 0.048, 
P=0.121, n=1,027 trials). If the maximum number of trials was reached on one 
day, trials were continued the following day. To ensure that the colour was the cue 
being associated with a food reward, rather than location, the position of the baited 
well was pseudo-randomized and was never on the same side of the foraging grid 
for more than three consecutive trials. Furthermore, both wells were wiped with 
cheese to control for olfactory cues. Following ref. 20, an individual was considered 
to have succeeded at the task when it pecked the rewarded colour in at least 10 
out of 12 consecutive trials (10/12 correct represents a significant deviation from 
random binomial probability; binomial test: P= 0.039). The number of trials 
taken to reach this criterion (including the final 12 trials) was the associative 
learning score. 

Reversal learning. Twenty-four hours after the completion of the associative learn- 
ing task, individuals were tested on a reversal learning task. The same foraging 
grid was presented; the only difference being the colour of the rewarded lid was 
reversed from that of the associative learning task. Otherwise the experimental 
protocol and the criteria for passing were the same as the associative learning task 
described above. 

Spatial memory. The spatial memory task consisted of a wooden foraging grid 
(40 x 36 x 4.5cm), containing eight wells (3.5 cm diameter, 2.5 cm deep). The wells 
were equidistant from one another (6 cm between wells) and were arranged in three 
rows, with the first row containing two wells, the second row four wells and the 
third row two wells (Extended Data Fig. 1c). The wells were covered with light- 
blue lids that were the same as those used in the associative and reversal learning 
experiments, so no training phase was necessary. Following the protocol described 
in ref. 20, the experiment consisted of five phases. One of the eight wells was ran- 
domly chosen to be the baited well, containing a food reward in all phases of the 
experiment. The first phase was a ‘baseline’ trial in which individuals searched the 
foraging grid for the baited well. Once the test subject had located and eaten 
the food reward, the foraging grid was removed. Five minutes after the baseline 
trial, the second ‘training’ phase was carried out, during which the same well was 
baited, and the test subjects had to search for the food reward again. The third and 
fourth phases were test trials in which subjects were presented with the foraging 
grid 24 and 48h after the training phase, respectively. The cumulative number of 
wells searched before locating the rewarded well in the third and fourth phases of 
the experiment was the spatial memory score, thus higher scores indicate worse 
performance. To control for olfactory cues, the foraging grid was presented a fifth 
time as an unrewarded ‘probe trial” (five minutes after the 48-h post-training 
phase trial), in which the grid was rotated 180 degrees, and without a baited well. 
The foraging grid would appear identical to the magpie, but the position of the pre- 
viously baited well would be on the opposite side of the grid compared to the other 
phases of the experiment. If the test subject had remembered the location of the 
rewarded well in the experimental phases, one would predict that it would search 
the well opposite the previously baited well. If the test subjects were using olfactory 
cues to locate the rewarded well, one would predict that the previously baited well 
would be searched first. To investigate whether birds were using olfactory cues to 
locate the food reward a paired t-test was carried out to see whether there was a 
difference between the number of wells searched in the 48-h post-training phase 
trial and the fifth trial (see Supplementary Discussion for results). The number of 
wells searched in the fifth presentation did not count towards the spatial memory 
score. 

To ensure that we tested individual performance, and to control for the poten- 
tially confounding effect of social learning or social interference, all trials were 
carried out in conditions as close as possible to social isolation. This was achieved 
by ensuring that no other birds were within 10m of the bird being tested. This was 
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possible as magpies often forage over 10m away from each other. If another bird 
approached during an experimental trial, the trial was discontinued. To investi- 
gate whether individual performance was affected by social learning, we included 
‘test order’ as an explanatory term in the analyses investigating factors affecting 
performance. This allowed us to verify that individuals tested later within a group 
(who could therefore have had opportunities to observe previous group members 
being tested) did not perform better than those tested earlier. Tasks were placed 
directly in front of the test subjects. Experiments were run between 05:00 and 10:00 
and were recorded live by the observers (B.J.A. and E.K.E.). One observer recorded 
individual performance, while the other recorded neophobia (defined as the time 
elapsed between the test subject first came within 5 m of the apparatus and first 
touching the apparatus), the time spent interacting with the task, and anti-predator 
behaviour within the group. 

Individual consistency in adult performance—repeatability testing. Apparent 
individual differences in cognitive performance in a single round of testing could 
simply result from stochastic variation or extraneous confounding variables!!. To 
determine whether individuals were consistent in their performance, we carried 
out a second test battery two weeks after the first test battery to test the repeatability 
of adult cognitive performance. To ensure that individuals could not simply use 
memory of visual cues from the first round of testing to solve tasks in the second 
round, we changed the visual appearance of each task, while keeping the causal 
structure of the task the same. In the associative and reversal learning task the 
colour of lids was changed to dark green and light green. In the spatial memory 
task, the location of the rewarded well was changed from the first test battery. 
In the inhibitory control task, rather than using an open-ended cylinder, we 
presented food rewards behind a transparent curved wall (30cm length, 10cm 
height; Extended Data Fig. 4d). Other than these changes in the appearance of 
the tasks, the protocol and criteria for passing were exactly the same as the first 
cognitive test battery. 

Juvenile cognitive test battery. Juveniles were presented with a battery of four 
cognitive tasks at three ages: 100, 200 and 300 days after fledging (Extended Data 
Fig. 4). Cognitive testing commenced at 100 days after fledging, because by this 
time, individuals spend most of their time foraging independently*!. The same 
four cognitive traits (inhibitory control, associative learning, reversal learning and 
spatial memory) were quantified at each age by presenting individuals with cogni- 
tive test batteries containing causally identical but visually distinct versions of each 
of the four tasks (Extended Data Fig. 4). This ensured the same cognitive traits were 
tested at each age, while making sure the tasks were not the same in appearance, 
minimizing the potentially confounding effect of memory. 

To quantify inhibitory control at 100 days after fledging, we presented 
individuals with the same detour reaching task used in the adult cognitive test 
battery (Extended Data Figs 1a, 4a), and used the same experimental protocols 
and criteria for succeeding at the task. At 200 days after fledging, rather than 
using a transparent open-ended cylinder, food rewards were presented behind a 
transparent curved wall (32 cm length, 12 cm height; Extended Data Fig. 4d). At 
300 days after fledging, individuals were presented with a detour reaching task con- 
sisting of a transparent ‘umbrella’ the food reward could be accessed by detouring 
underneath the transparent Perspex dome (55cm circumference, 8 cm height; 
Extended Data Fig. 4g). Regardless of the differences in appearance, the criteria 
for passing the inhibitory control tasks at 200 and 300 days after fledging were 
the same as for the first detour reaching task presented at 100 days after fledging. 

Spatial memory was quantified at 100 days after fledging by presenting indi- 
viduals with a wooden foraging grid (40 x 26 x 4cm) containing six wells (3.5cm 
diameter, 2.5cm deep) covered with lids as used for the associative and reversal 
learning tasks (Extended Data Fig. 4c). One of the six wells was randomly assigned 
to be the rewarded location for all phases of the experiment. The spatial memory 
experiment consisted of two phases; first the grid was presented in a baseline trial, 
in which individuals were able to search for the rewarded location. A memory 
trial was carried out five minutes later, during which individuals were presented 
with the foraging grid in the exact same arrangement a second time. The number 
of wells searched before locating the food reward in the memory trial was the 
spatial memory score. At 200 and 300 days after fledging the same experiment was 
carried out, although we ensured that a different well was randomly assigned as 
the rewarded location (Extended Data Fig. 4f, i). 

We were unable to quantify individual performance in associative and reversal 
learning at 100 days after fledging, because individuals took a prohibitive amount of 
trials to complete the tasks (no individuals succeeded within 20 trials). Associative 
learning was quantified at 200 days after fledging by presenting test subjects with a 
wooden foraging grid (41 x 35 x4cm) containing 20 wells (3.5 cm diameter, 2.5cm 
deep), covered with 20 plastic lids; 10 a light-blue colour and 10 a dark-blue colour 
(Extended Data Fig. 4e). Wells covered with lids of one colour were randomly 
assigned to be rewarded for the duration of the trials, when these lids were pecked, 
a food reward could be accessed. Test subjects were considered to have passed the 


associative learning task when they chose the rewarded well in eight out of the 
first nine wells searched; this represents a significant deviation from binomial 
probability (binomial test: P= 0.039). The number of trials taken to reach this 
criterion was the associative learning score. Individuals received a maximum of 
10 trials; those that failed to pass were assigned a score of 10 for statistical analyses. 
At 300 days after fledging rather than light- and dark-blue lids, light- and dark- 
green lids were used (Extended Data Fig. 4h). Other than the change in lid colour, 
all protocols and criteria for passing remained the same. 

At each developmental stage (200 and 300 days after fledging), reversal learning 
was quantified 24h after the successful completion of the associative learning task. 
The protocol and criteria for passing the reversal learning tasks were the same as 
the associative learning task, except that the previously unrewarded colours were 
now rewarded. 

Similar to the cognitive testing carried out on adults, all trials were carried out 

on juveniles in isolation. This was achievable because by 100 days after fledging, 
juveniles are foraging independently. 
Life-history data collection. To obtain measures of reproductive success for 
individual birds, we collected life-history data on the study population over three 
years. This was collected through a combination of behavioural focal follows on 
individuals, brood observations and ad libitum data collected while watching 
the whole group (for details see refs 31, 33). The extensive life-history data- 
base developed from these observations allowed us to determine the number of 
hatched clutches, the number of nestlings that fledged and the number of fledg- 
lings surviving to independence for each adult female in the study population per 
annum. In addition, the behavioural focal observations, brood observations and 
ad libitum data allowed us to quantify the amount of food adults provisioned to 
young. Fledglings were considered to have survived to independence when they 
reached three months after fledging. At this age, magpies forage independently 
and are fed by adults infrequently*!. In addition to these three proxies of fitness, 
we also recorded the number of breeding attempts by females—a breeding attempt 
was considered to have occurred if a female was observed incubating on a nest. The 
mother was assumed to be the bird incubating at the nest (there is no evidence of 
egg-dumping or shared incubation in this subspecies, so there was only ever one 
female incubating a given nest). Groups were visited at least once a week during 
the breeding season, providing accurate measurements of the number of breeding 
attempts made per female, and accurate hatch and fledge dates for all nests. 
Clutches were considered to have hatched when adults started bringing food to 
the nest, or if we could see young in the nest. As many nests were upwards of 20m 
high, we were unable to accurately determine clutch size to use as an additional 
measure of reproductive success. 

All methods were performed in accordance with the guidelines and regulations 

of the University of Western Australia, and were approved by the University of 
Western Australia Animal Ethics Office (RA/3/100/1272). 
Statistical analyses. Adult cognitive performance. To determine the factors influ- 
encing individual variation in cognitive ability, we analysed cognitive performance 
using generalized linear mixed models (GLMM) with either a Poisson distribu- 
tion with a logarithmic link (inhibitory control) or a negative binomial distribu- 
tion with a logarithmic link to account for overdispersion (associative learning, 
reversal learning and spatial memory). Cognitive performance was measured as 
the number of trials taken to pass the task. In addition to the potential cognitive 
demands of living in larger social groups, it is possible that indirect effects of 
group size on energy intake and task attention could generate group-size effects 
on cognitive performance’, We therefore included neophobia (defined as the 
time taken to interact with the task once being within 5 m of it), body mass and 
foraging efficiency as explanatory terms in the analysis, as well as sex, the sex ratio 
of males to females in the group, the order tested within the group and group size. 
Group identity was included as a random term in all models. 

To determine whether body condition (body mass, accounting for skeletal 
size) could explain the variation in cognitive performance, we included mass (in 
grams) and tarsus size (in mm; a common analysis of skeletal size in birds) as 
covariates in an additional analysis on a subset of individuals for which both of 
these morphometric analyses were available (n = 27). Dominance status was not 
included as an explanatory variable, because there is no clear dominance hierarchy 
within magpie groups. Adult age and immigration status were not included as 
explanatory variables, because the fledge date and natal origins of some of the 
adults in our population is unknown (Australian magpies are very long-lived, 
living up to 25 years in the wild’). We note that among the birds for which the 
complete life-history is known (n= 19 individuals), there has been no movement 
between groups. 

We analysed our data using a model selection process; terms were ranked in 
order of their corrected quasi-information criterion (QICc) values (the lowest QICc 
value has the greatest explanatory power‘). If a term was more than two QICc 
units smaller than any other term, then this was judged to explain the observed 
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relationship in the data better than any other term. If there was more than one 
term with AQICc < 2 from the ‘best’ term, had confidence intervals that did not 
intersect zero and explained more variation than the basic model (the model 
containing no predictors, just the constant and the random terms), then model 
averaging was carried out on this top set of models as described in ref. 44. All 
statistical analyses were carried out using IBM SPSS Statistics software (v.22). 

To examine the relationship in performance across tasks, we conducted 
Spearman's rank pairwise correlations between all four tasks. To determine whether 
a general cognitive factor explained cognitive performance across all four tasks, we 
performed a principal component analysis with a varimax rotation. Only principal 
components with an eigenvalue >1 were extracted from the analysis. A general 
intelligence factor has been argued to exist when all four tasks positively contrib- 
uted to the first principal component and explain >30% of total task variance”. 
Following ref. 20, to assess whether the tasks were associated with the first princi- 
pal component by chance, we compared the mean and standard deviation of the 
first component factor weights to the 95% confidence intervals of the means and 
standard deviations of the first component factor weights from 10,000 simula- 
tions. For each simulation, performance within each task was randomized between 
individuals (using the randomizeMatrix function in the picante R package”), a 
principal component analysis was performed, and the mean and standard devi- 
ation of the first component factor weights were obtained. The 95% confidence 
intervals were then calculated from the stored means and standard deviations 
from all the simulations. 

Statistical analyses used to calculate estimates of repeatability in cognitive 

performance between the first and second cognitive test batteries were carried 
out in R (v.3.1.1, http://www.r-project.org) with the rptR package” using a 
linear mixed model repeatability estimate, with a restricted maximum likelihood 
function (REML). 
Juvenile cognitive performance. A series of GLMMs were carried out to determine 
which factors affect cognitive performance in each task. Model selection (using 
the same approach as for analyses on adult cognitive performance) was then 
used to determine the most significant predictors of performance in each of the 
cognitive tasks’, 

At 100 days after fledging, the response terms used were cognitive performance, 
in the detour reaching task this was the number of trials until passed and in the 
spatial memory task it was the number of wells searched. As these were count data, 
GLMMs with a Poisson distribution were used. The relationship between perfor- 
mance in the detour reaching task and the spatial memory task were examined 
using a Spearman rank correlation. At 200 and 300 days after fledging, we found 
evidence of general cognitive performance in juvenile magpies (Extended Data 
Tables 3, 4); this parameter was therefore used as the response term for analyses 
investigating factors affecting cognitive performance at 200 and 300 days after 
fledging. 

Explanatory terms included in the models were neophobia, body mass, the stage 
of the breeding season (early or late), the presence or absence of siblings (from the 
same brood), group size and the sex ratio of adult males to females in the group. We 
were unable to include provisioning rate from adults to fledglings as an explanatory 
term in analyses as these data were only available for a small subset of individuals. 
Group ID was included as a random term in all models. 

Factors affecting performance across all ages were analysed for each of the four 
cognitive traits quantified, using GLMMs. Four separate analyses were carried out, 
with cognitive performance used as the response term. Two additional analyses 
were carried out. First, to determine factors affecting performance across all ages 
for both inhibitory control and spatial memory (associative and reversal learning 
were omitted from this analysis as we only quantified performance at 200 and 
300 days after fledgling for these traits). Second, we investigated factors affecting 
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general cognitive performance measured at 200 and 300 days after fledging. Group 
ID and individual ID were included as random terms. Explanatory terms included 
were those used for the previous analyses. A model-selection approach was used 
to determine the most significant terms affecting performance. 

Relationship between cognitive performance and measures of reproductive success. 
We carried out three separate analyses to determine the factors affecting three 
measures of reproductive success: the average number of hatched clutches per 
year, the average number of nestlings fledged per year and the average number of 
fledglings surviving to independence per year. We carried out GLMMs, with the 
analysis of reproductive success as the response term, and group ID was included 
as a random term. Explanatory terms included in the analyses were body mass, 
foraging efficiency, group size, the sex ratio of the group and general cognitive 
performance. General cognitive performance was used as an explanatory term for 
cognitive performance, because the principal component analysis revealed robust 
evidence for its existence within females (PC1 accounted for >70% of total variance 
in female task performance, Supplementary Table 25). We did not include age, 
because we do not know the exact fledge date for the majority of adult females in 
the population. 

Data availability. The data that support the findings of this study have been 
deposited in the Dryad Digital Repository (https://doi.org/10.5061/dryad.ph3h8). 
Source Data have been provided for Figs 1-4 and Extended Data Figs 2, 3. 


31. Pike, K.N. How much do Helpers help? Variation in Helper Behaviour in the 
Cooperatively Breeding Western Australian Magpie. MSc Thesis, Univ. Western 
Australia (2016). 

32. Kaplan, G. Australian Magpie: Biology and Behaviour of an Unusual Songbird 
(CSIRO Publishing, 2004). 

33. Edwards, E. K., Mitchell, N. J. & Ridley, A. R. The impact of high temperatures 
on foraging behaviour and body condition in the Western Australian Magpie 
Cracticus tibicen dorsalis. Ostrich 86, 137-144 (2015). 

34. Morand-Ferron, J. Why learn? The adaptive value of associative learning in wild 
populations. Curr. Opin. Behav. Sci. 16, 73-79 (2017). 

35. Sherry, D. F. in Cognitive Ecology: the Evolutionary Ecology of Information 
Processing and Decision Making (ed. Dukas, R.) 261-296 (Univ. Chicago Press, 
1998). 

36. Lotem, A. & Halpern, J. Y. Coevolution of learning and data-acquisition 
mechanisms: a model for cognitive evolution. Phil. Trans. R. Soc. B 367, 
2686-2694 (2012). 

37. Leadbeater, E. What evolves in the evolution of social learning? J. Zool. 295, 
4-11 (2015). 

38. Amici, F., Aureli, F. & Call, J. Fission-fusion dynamics, behavioral flexibility, and 
inhibitory control in primates. Curr. Biol. 18, 1415-1419 (2008). 

39. Boogert, N. J., Giraldeau, L.-A. & Lefebvre, L. Song complexity correlates 
with learning ability in zebra finch males. Anim. Behav. 76, 1735-1741 
(2008). 

40. Rowe, C. & Healy, S. D. Measuring variation in cognition. Behav. Ecol. 25, 
1287-1292 (2014). 

Al. Griffin, A. S. & Guez, D. Innovative problem solving in nonhuman animals: the 
effects of group size revisited. Behav. Ecol. 26, 722-734 (2015). 

42. Nowicki, S., Searcy, W. & Peters, S. Brain development, song learning and mate 
choice in birds: a review and experimental test of the “nutritional stress 
hypothesis”. J. Comp. Physiol. A 188, 1003-1014 (2002). 

43. Burnham, K. P. & Anderson, D. R. Model Selection and Multimodel Inference: 

A Practical Information-Theoretic Approach (Springer, 2002). 

44. Symonds, M. R. E. & Moussalli, A. A brief guide to model selection, multimodel 
inference and model averaging in behavioural ecology using Akaike’s 
information criterion. Behav. Ecol. Sociobiol. 65, 13-21 (2011). 

45. Kembel, S. W. et a/. Picante: R tools for integrating phylogenies and ecology. 
Bioinformatics 26, 1463-1464 (2010). 

46. Nakagawa, S. & Schielzeth, H. Repeatability for Gaussian and non-Gaussian 
data: a practical guide for biologists. Biol. Rev. Camb. Philos. Soc. 85, 935-956 
(2010). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Figure 1 | Adult cognitive test set. a-c, The cognitive test series used to quantify individual variation in inhibitory control (a), 
associative and reversal learning (b) and spatial memory (c). 
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Extended Data Figure 2 | Developmental trajectory of cognitive 
performance. a, b, The developmental trajectory of Australian magpies at 100, 
200 and 300 days after fledging for two cognitive traits: behavioural inhibition 
(a; n= 48 trials) and spatial memory (b; n = 46 trials). c, Developmental 
trajectory for behavioural inhibition and spatial memory combined 

(n=94 trials). Green dots, individuals from small groups (containing 1-7 
individuals); blue dots, individuals from large groups (>8 individuals). Scores 
are measured as either the number of trials taken to succeed at the task or the 
number of locations searched, so lower scores indicate better performance. 
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Extended Data Figure 3 | Frequency distribution of general cognitive 
performance in relation to group size. a, b, Frequency distribution of 
general cognitive performance among individuals in small groups 

(a; containing <8 individuals, n = 29 individuals) and large groups 

(b, >8 individuals, n = 17 individuals). 
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Extended Data Figure 4 | Juvenile cognitive test batteries. a—i, Cognitive unable to complete the associative and reversal learning tasks at 100 days 
test batteries presented to individuals at 100 (a—c), 200 (d-f) and 300 (g-i) after fledging. Red circles indicate that individuals had to search a different 
days after fledging, containing four tasks designed to quantify inhibitory location at each age tested in order to obtain the food reward in the spatial 
control (a, d, g), associative and reversal learning (b, e, h) and spatial memory task. 

memory (c, f, i). b is shown in black and white, because individuals were 
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Extended Data Figure 5 | Example of the lids used in the cognitive and swivelled when pecked, allowing individuals to search wells for their 
tasks. The lids used in the associative learning, reversal learning and contents. 
spatial memory tasks. The lids were held firmly in place by elastic bands, 
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Extended Data Table 1 | Principal component analysis (adults) 


Task PC1 

Inhibitory control 0.703 
Associative learning 0.789 
Reversal learning 0.870 
Spatial memory 0.841 
Eigenvalue 2.582 
% of total variance explained 64.56 


Results of the principal component analysis for adult magpies that completed all four tasks. 
All four tasks positively contributed to the first principal component extracted with an eigenvalue 
>1.n=46 individuals. 
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Extended Data Table 2 | Repeatability of cognitive performance 


Task Repeatability SE Confidence intervals P 
‘Inhibitorycontrol = ti(i‘é ONG: 49 (0.691, 0.882 <0.0001 ~ 
Associative learning 0.970 0.01 0.946, 0.983 <0.0001 
Reversal learning 0.975 0.008 0.954, 0.986 <0.0001 
Spatial memory 0.932 0.021 0.879, 0.963 <0.0001 


Estimations of repeatability for the first and second series of cognitive tests. Inhibitory control, n= 56 individuals; associative learning, n=46 individuals; reversal 
learning, n=46 individuals; spatial memory, n= 46 individuals. 
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Extended Data Table 3 | Principal component analysis (200 days 


after fledging) 
Task a al 
Inhibitory control 0.571 
Associative learning 0.916 
Reversal learning 0.941 
Spatial memory 0.737 
Eigenvalue 2993 
% of total variance explained 64.837 


Results of the principal component analysis for magpies that completed all four tasks at 200 days 
after fledging. All four tasks positively contributed to the first principal component extracted with 
an eigenvalue >1.n=15 individuals. 
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Extended Data Table 4 | Principal component analysis (300 days 


after fledging) 

Task PCI 

Inhibitory control 0.675 
Associative learning 0.947 
Reversal learning 0.972 
Spatial memory 0.957 
Eigenvalue 3.215 
% of total variance explained 80.363 


Results of the principal component analysis for magpies that completed all four tasks at 300 days 
after fledging. All four tasks positively contributed to the first principal component extracted with 
an eigenvalue >1. n= 10 individuals. 
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Posterior parietal cortex represents sensory history 
and mediates its effects on behaviour 


Athena Akrami!*, Charles D. Kopec!*, Mathew E. Diamond‘ & Carlos D. Brody? 


Many models of cognition and of neural computations posit 
the use and estimation of prior stimulus statistics!~*: it has long 
been known that working memory and perception are strongly 
impacted by previous sensory experience, even when that sensory 
history is not relevant to the current task at hand. Nevertheless, the 
neural mechanisms and regions of the brain that are necessary for 
computing and using such prior experience are unknown. Here we 
report that the posterior parietal cortex (PPC) is a critical locus 
for the representation and use of prior stimulus information. We 
trained rats in an auditory parametric working memory task, and 
found that they displayed substantial and readily quantifiable 
behavioural effects of sensory-stimulus history, similar to those 
observed in humans™® and monkeys’. Earlier proposals that the 
PPC supports working memory®” predict that optogenetic silencing 
of this region would impair behaviour in our working memory 
task. Contrary to this prediction, we found that silencing the PPC 
significantly improved performance. Quantitative analyses of 
behaviour revealed that this improvement was due to the selective 
reduction of the effects of prior sensory stimuli. Electrophysiological 
recordings showed that PPC neurons carried far more information 
about the sensory stimuli of previous trials than about the stimuli of 
the current trial. Furthermore, for a given rat, the more information 
about previous trial sensory history in the neural firing rates of the 
PPC, the greater the behavioural effect of sensory history, suggesting 
a tight link between behaviour and PPC representations of stimulus 
history. Our results indicate that the PPC is a central component in 
the processing of sensory-stimulus history, and could enable further 
neurobiological investigation of long-standing questions regarding 
how perception and working memory are affected by prior sensory 
information. 

Finding long-term regularities in the environment, and exploiting 
them, is a critical brain function in a complex yet structured world. 
However, little is known about the neural mechanisms involved in 
estimating these regularities or their impact on memory. The history 
of sensory stimuli affects working memory!®"! and many other tasks 
involving sensory percepts'*!3. One salient example, discovered over 
a century ago’ and repeatedly observed in human cognition®!*!, 
is contraction bias, in which the representation of a stimulus held in 
working memory shifts towards the centre of the distribution of stimuli 
observed in the past (the prior distribution). Despite the ubiquity of 
this phenomenon, and much psychophysical and theoretical research 
into the use and effects of prior stimulus distributions”, the neural 
mechanisms of contraction bias have not been identified. 

On the basis of previous work using somatosensory stimuli®, and 
inspired by parametric working memory (PWM) tasks in primates’, 
we developed a computerized protocol to train rats, in high-throughput 
facilities, to perform a novel auditory PWM task (behavioural shaping 
code at http://brodylab.org/auditory-pwm-task-code). PWM tasks 
involve the sequential comparison of two graded (that is, analogue) 
stimuli separated by a delay of a few seconds. Here we used auditory 
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Figure 1 | Rat performance and contraction bias. a, Rats compared two 
sequentially presented auditory stimuli s, and s,, separated by a delay, and were 
rewarded for correctly reporting, through their choice of inserting their nose 
into the left or right ports, which stimulus was louder. b, Set of (s,, s,) pairs 
used in a session. In each trial, one randomly selected pair was presented. The 
small purple squares represent stimuli used in a subset of sessions to assess 
performance at the psychometric threshold. c, Contraction bias proposes that 
the presented stimuli (black boxes) drive behaviour as if s, were closer (dashed 
boxes) to the average stimulus (s) (vertical midline) than its actual value. For 
some (sq, Sp) pairs, this decreases their difference, and thus impairs performance 
(bias, red), whereas for others it has the opposite effect (bias*, green). 
d, Overall average performance as a function of delay duration (n= 21 rats, 
mean +s.d. over subjects). e, Psychometric curves for one example rat (n = 120 
sessions, mean + s.e.m. over sessions, fits to a four-parameter logistic function; 
see Methods). f, Midpoint tangent line slopes for the psychometric curves for 
each of 12 rats. These are significantly greater (reflecting better performance) at 
delays of 2s compared with 6s (two sample t-test: 2 versus 4s: P=0.051; 2 versus 
6s: P=0.045; 4 versus 6s: P=0.86). The mean and s.d. of the 12 rats are shown 
in red; the results for rat W026 are shown in magenta. NS, not significant. 
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Figure 2 | Sensory history biases behaviour. a, Rat auditory trials. Left, 
percentage of trials in which rats went left minus the average value of left 
choices, as a function of the stimuli of the previous trial, for fixed previous 
trial response choice and reward. Right, percentage of trials in which rats 
went left for each combination of current and previous stimuli; vertical 
modulation indicates the previous-trial effect. b, As in a, but for human 
subjects in the auditory PWM task. c, As in a, but for human tactile trials. 
d, A linear weighted sum of 9 regressors is used to predict the probability 
ratio log(Pyo tett/Pgo right); Weights are fit to best-match training data, and are 
evaluated on left-out cross-validation data (see Methods). Regressors: 
averaged stimuli over the last 20-50 trials (excluding the last two); stimuli 
from last two trials; correct side on last trial (that is, the side baited with a 
reward; when b,' is positive, this increases the probability of going towards 
the previously baited side, that is, win-stay/lose-shift); current trial stimuli; 
overall side bias. e, Evaluation of model variants with different regressors 


pink noise stimuli, denoted s, and s,; rats were rewarded for correctly 
reporting which of the two was louder (Fig. 1a). Following ref. 16, the 
set of (Sa, sp) pairs used across trials in a session was chosen so that nei- 
ther stimulus alone contained sufficient information to solve the task 
(Fig. 1b). As with any magnitude-discrimination task, the smaller the 
difference between the stimuli, the harder the task (Fig. 1c). Classical 
contraction bias” argues that during the delay, the memory of the mag- 
nitude of s, drifts towards the mean of all stimuli presented (Fig. 1c, 
vertical line (s)). Consequently, for those pairs in which s, drifts away 
from the high-difficulty s,=s, diagonal, s, becomes more distinct from 
sp in memory, so contraction bias would improve performance (bias", 
Fig. 1c). In pairs for which s, drifts towards the diagonal, performance 
would decrease (bias). 

This predicted pattern is observed in the behaviour of the rats during 
our experiments (Fig. 1b, high percentage correct for bias* stimulus 
pairs (s,= 84, s, =92) and (s,=68, sp = 60), lower percentage correct 
for bias” stimulus pairs (s,= 60, sp = 68) and (s,= 92, sp = 84)). The 
same pattern has been observed in monkeys’ and humans (Extended 
Data Fig. 1d, e, and refs 5, 6, 17). History-dependent effects are prob- 
ably adaptive in the natural world, in which there are many long-term 
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Weight value 


(see Methods; Extended Data Fig. 6). The best model has the regressors shown 
in d. Model A has all regressors shown in d; in models B to E, the following 
regressors are progressively removed: long-term sensory history (B); 

stimuli of the last two trials (short-term history, C and D); previous trial 
correct side (E)—model E therefore has no history information. The next 
two models (F and G, shown in red) have the same regressors as A and B, 
respectively, but the weights on the s, of the current trial plus all previous 
sensory-stimuli weights are constrained to sum to 1, removing one free 
parameter. cv, cross validation. f, A poor match to the data in a is found from 
the predictions of a model with the current trial and previous trial’s correct 
side regressors only. g, As in f, but now including sensory-history regressors, 
which greatly improves the match to the data. h, Summaries of best-fit 
parameters over all subjects, from model F in e, with additional ‘lapse term. 
Black ticks, best-fit parameter values, per subject; grey bars, 95% confidence 
intervals. All panels sorted based on value of w?. 


regularities. But in our laboratory task, in which each trial is gener- 
ated independently, such biases, on average, produce suboptimal 
performance. The overall performance of the rats was robust and 
similar across delay intervals from 2-12 s (Fig. 1d; see Extended Data 
Fig. 1b, c for performance over learning). In some sessions, we used 
stimulus pairs that were closely spaced along s, (Fig. 1b) to measure 
the psychometric discrimination threshold. This worsened slightly, but 
significantly, at longer delay periods (Fig. le, f). 

Whereas variation of the delay interval resulted in small effects on 
overall performance (Fig. 1d), sensory history, by contrast, had a strong 
effect on performance. As quantified below, this effect was greater than 
the well-documented influence of previous rewards and choices!®-*° 
(see also Extended Data Fig. 1g-i). In the trials that followed a 
rewarded, rightwards choice—thus holding previous choice and reward 
fixed—it was found that the smaller the stimuli of the previous trial, 
the greater the percentage of leftwards choices in the current trial (Fig. 
2a, left; slope = —3.06% per decibel, linear fits to percentage leftwards 
minus average, P < 0.0001; see Extended Data Fig. 5a for slopes from 
n= 1-7 trials back). This is consistent with a contraction bias in which 
the estimate of (s) is weighted towards recent stimuli!’, because recent 
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small values make it more likely that the current s, is perceived as small, 
increasing the likelihood of an s, < sp (leftwards) response. Figure 2a, 
right shows that the same effect occurred across all combinations 
of current and previous trial stimuli from our standard stimulus set 
(|sq — Sp| fixed at 8 dB; see Extended Data Fig. 3 for n = 1-5 trials back, 
and Extended Data Fig. 4 for controlling for action and reward). Similar 
effects were found in human auditory (Fig. 2b) or tactile (Fig. 2c) 
versions of the task, and increased for larger delay intervals*!”? 
(Extended Data Fig. 2). To simultaneously take into account effects 
across several previous trials of the history of rewards, choices and 
stimuli, we fit logistic regression models with these variables as regres- 
sors, and compared the performance of a variety of such models 
on cross-validation data (Fig. 2d, Methods, Extended Data Fig. 6). 
Consistent with human data!’, short-term (previous two trials) sen- 
sory history had a strong effect on behaviour. In addition, our large 
dataset revealed a smaller but nevertheless important effect of longer- 
term (average of previous few tens of trials) sensory history (Fig. 2e, 
Extended Data Fig. 5b). It has been suggested*!?*-*” that sensory his- 
tory does not add a behavioural bias independent of working memory, 
but instead produces a value of s, in working memory that is a weighted 
average of the current stimulus and sensory history. In weighted aver- 
ages, the weights sum to one. Consistent with that suggestion, we found 
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Figure 3 | The PPC is specifically necessary for the behavioural effect 

of previous sensory stimuli. a, Schematic of virus injection and full 

trial inactivations (delay =2 s). The atlas panel is taken from ref. 31. 

AP, anteroposterior; ML, mediolateral. b, Left, the stimuli included our 
standard (s,, sp) set (black) plus psychometric pairs (purple); right, the 
ideal psychometric performance. c, Psychometric curves for all rats were 
closer to ideal during PPC inactivation (yellow) trials than during control 
(grey) trials. Far right, sham inactivation in rats with optic fibres but 
without virus had no effect (n =2). Error bars show s.e.m. d, Percentage 
correct averaged across all bias* or all bias~ trials (Fig. 1c), relative to 
overall average performance. PPC inactivation eliminates the difference 
between bias and bias” trials, or versus overall average (t-test, bias* 

— bias” significantly different from zero, laser off, P< 0.00001; laser on, 
P=0.706; laser off versus on, P < 0.01; two-way analysis of variance, 
interaction of laser on/off and biast/bias~: P< 0.01). Error bars show 
s.e.m. e, The bias induced by previous stimuli is reduced under PPC 
inactivation (laser off: slope of —4.74, P=0.0017; laser on: slope of —1.36, 
P=0.42; laser on versus off: P= 0.044). f, Left, PPC inactivation selectively 
reduces sensory-history weights in the regression model shown in Fig. 

2d. Error bars show 95% confidence interval (n = 600, 200 iterations of 
threefold cross-validation; *P < 0.01, one-sided t-test). Right, sum of all 
other weights. See Extended Data Fig. 9 for individual weights. g, Reducing 
sensory-history weights in the model is sufficient to improve psychometric 
performance, comparable to experimental data (c). h, Similar to c, for 
inactivation during either s, (left), delay (middle) or sy (right). The insets 
show the sum of all sensory-history regressors, as in f (n = 600). Only 
inactivation of the PPC during the delay produces a significant effect 
(permutation test, s,, laser off versus on: P= 0.17; delay: P< 0.0001; 

sp: P= 0.18; s, versus delay: P= 0.03, sp versus delay: P= 0.02). 


that constraining the sum of regression weights on the s, stimulus of the 
current trial plus weights on previous sensory stimuli to equal one— 
thus removing one free parameter from the model—produced the best 
performance on cross-validation data (Fig. 2e, red; see Extended Data 
Fig. 6 for all models and comparisons; the best-fitting model, for every 
individual rat, had the regressors in Fig. 2d). Sensory history was essen- 
tial in accounting for behaviour (Fig. 2f, g): examining the weights in 
the regression model (Fig. 2h) shows that those for sensory history are 
significantly larger than those for correct side history (P < 0.001), and 
have a greater impact on behaviour (Extended Data Fig. 6g). 

It has been proposed that the PPC is critical for working memory 
(refs 8, 9, 28, but see also refs 29, 30), and therefore we examined its role 
in our task. We injected bilaterally an AAV virus that drives expression 
of the light-activated inhibitory opsin halorhodopsin eNpHR3.0, under 
the CaMKII promoter (centre of injection, anteroposterior —3.8mm 
and mediolateral 2.5 mm from the bregma, Fig. 3a, Extended Data 
Fig. 7a). Sharp optical fibres were inserted at the centres of injection 
sites to deliver laser illumination, and we inactivated the PPC during a 
randomly chosen 20% of trials. To best probe for any small effects, we 
included psychometric stimuli (Figs 1b and 3b). 

Expecting a performance impairment’, we were surprised to instead 
observe an improvement in psychometric performance in all rats tested 
(Fig. 3c). However, the effect was not simply an overall performance 
improvement: looking beyond the psychometric stimuli, although 
performance was indeed improved with respect to control on bias~ 
trials, PPC silencing instead impaired performance on bias‘ trials 
(Fig. 3d). Moreover, the difference between bias* and bias” trials was 
eliminated, as was their difference from control average performance 
(Fig. 3d, Extended Data Fig. 9a). Similarly, bias as a function of the 
stimuli of the previous trial was markedly reduced (Fig. 3e, laser 
off: P=0.42; laser on: P=0.0017; laser on versus off: P= 0.044, see 
Extended Data Fig. 8 for effect on history matrices). Fitting our regres- 
sion model separately to the set of laser on versus off trials, we found 
that PPC inactivation significantly reduced sensory-history regression 
weights (Fig. 3f), and no other regression terms were significantly 
affected (individual weights in Extended Data Fig. 9). A model with 
reduced sensory-history effects as in Fig. 3f was sufficient to reproduce 
the experimental data (Fig. 3g). Therefore, PPC silencing appeared to 
have no impact on working memory, but specifically and substantially 
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significantly larger than the shuffled distribution (P < 0.005) are included; 
non-significant values are dark blue. To control for reward and choice, mutual 
information values were calculated using only trials with fixed choice and 
reward (see Methods). c, Summary of population analysis, showing the 
percentage of cells with significant coding of stimuli from trial i (red), or trial 
i— 1 (blue). d, Percentage of cells with significant mutual information about 
the previous trial, for each of five recorded rats, calculated over 5.3s during 
the ITI before a new trial. The horizontal dashed line indicates the percentage 
of cells expected by chance from shuffled data (see Methods). e, As in d; data 
from 5.3s of the new trial. f, Behavioural bias for individual rats, calculated as 
relative sensory-history weights (to weights for s, and sp regressors of trial i) 
from the best model fit. g, Neuron-behaviour correlation; data from f plotted 
against data from e, Spearman's rank correlation r= 1.0, P< 0.01, n=5 rats. 
h, Calculation as in d-g, using 2-s windows. High neuron-behaviour 
correlation appears concurrently with the s, of the new trial. Error bars show 
s.d. of the mean over n = 1,000 bootstrap samples with replacement. 


reduced sensory-history effects. This reduction did not persist into 
future trials (Extended Data Fig. 8), and occurred with PPC inhibition 
during the working memory delay, but not during s, or s, (Fig. 3h), 
suggesting a focused role for the PPC in the interaction between the 
previous stimuli and the working memory of the current trial. 

To examine whether signatures of sensory history are present in 
the region in which inactivation appears to cancel history effects, we 
made extracellular recordings that targeted the PPC during task per- 
formance, recording from 936 units in 5 rats implanted with microwire 
arrays. Neurons with a mean firing rate of below 2 Hz were discarded 
(Methods), leaving 361 units for the analysis. Most cells gave results 
similar to those of the example cell in Fig. 4a; their firing rates during 
the working memory delay did not distinguish between values of s, 
held in memory, and therefore did not carry information about them. 
Instead, robust information about the sensory stimulus pair appeared 
approximately 1s after the trial had terminated, during the inter-trial 
interval (ITI). We used mutual information (Methods) to quantify the 
amount of information, in neuronal firing rates, about which (sa, sp) 
sensory-stimulus pair was presented (Fig. 4b, c; see Extended Data 
Fig. 10 for mutual information about other components, including 
current and previous choices, rewards and s, alone). During the ITI 
before a new trial, a large fraction of PPC neurons carried significant 
information about the previous trial stimuli (22% of analysed neurons; 
Fig. 4c). A smaller fraction of cells continued to code the stimuli of 
the previous trial both into the start of the new trial and throughout 
the working memory delay of the new trial (Fig. 4c, Extended Data 
Fig. 10b). We computed the fraction of neurons with significant mutual 
information about the stimuli of the previous trial, during the ITI 
(Fig. 4d) or the current trial (Fig. 4e), and compared this to the 
strength of the sensory history behavioural bias of the rat (Fig. 4f). 
During the new trial, but not during the ITI, these two measures 
were perfectly correlated (Fig. 4g, Spearman’s rank correlation r= 1, 
P<0.01 during current trial; r=0.3, P=0.68 during ITI; P< 0.00001 
ITI versus full current trial from Steiger’s Z-test). This suggests two 
things: first, a tight link between sensory history representations in the 
PPC and history biases, and second, that the PPC history representa- 
tion is used during or shortly after the presentation of the new s, 
(Fig. 4h), consistent with the idea that contraction bias affects the 
working memory representation of s,. 

Parametric working memory tasks, with their quantifiable behav- 
iour, are well suited to investigating the effect of sensory history 
on perception and behaviour. Rodent versions of these tasks, with 
semi-automated training, are an efficient platform for causal and 
cellular-resolution investigation of neural mechanisms. By using 
this platform, we identified the PPC as an essential node in both the 
representation and causal effects of sensory-stimulus history. This 
represents a step towards a cellular-resolution understanding of 
long-standing questions about how sensory-stimulus history affects 
working memory and perception. Important issues that may now be 
addressed include how history representations in the PPC interact with 
current stimulus representations to modulate perception, how history 
information reaches the PPC, and which brain regions connected to 
the PPC are also essential nodes of the circuit. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Rat subjects. A total of 33 male Long-Evans rats (Rattus norvegicus) between the 
ages of 6 and 24 months were used for this study. Of these, 25 rats were used for 
behavioural assessments, 6 were used for neural recordings, and 7 for optogenetic 
inactivations. All rats were assigned randomly to different experimental conditions. 
All statistical tests were performed between groups with similar sample sizes. No 
statistical methods were used to predetermine sample size. Investigators were not 
blinded to experimental groups during data collection or analysis. Animal use 
procedures were approved by the Princeton University Institutional Animal Care 
and Use Committee and carried out in accordance with National Institutes of 
Health standards. 

Human subjects (auditory). 11 human subjects (8 males and 3 females, aged 
22-40) were tested and all gave their informed consent. Participants were paid to be 
part of the study and were naive to the main conclusions of the study. The consent 
procedure and the rest of the protocol were approved by the Princeton University 
Institutional Review Board. 

Human subjects (tactile). 14 human subjects (8 males and 6 females, aged 22-35) 
were tested. Protocols conformed to international norms and were approved by 
the Ethics Committee of the International School for Advanced Studies (Trieste, 
Italy). Subjects signed informed consent. 

Rat behaviour. We developed a computerized protocol to train rats, in 
high-throughput facilities, to perform an auditory delayed comparison task, 
adapted from a tactile version’. All training takes place in three-port operant con- 
ditioning chambers, in which ports are arranged side-by-side along one wall, with 
two speakers placed above the left and right nose ports. Figure 1a shows the task 
structure. A visible light-emitting diode in the centre port signals the availability 
of each trial. Rat subjects initiate a trial by inserting their nose into the centre port, 
which causes the centre light to turn off. Rats must keep their nose in the centre 
port (fixation period), until an auditory ‘go’ cue (a 6-kHz pure tone for 200 ms) 
signals the end of fixation. Only after the ‘go’ cue can subjects withdraw and orient 
to one of the side ports in order to receive a reward of water. During the fixation 
period two auditory stimuli, s, and sp, separated by a variable delay, are played for 
400 ms, with short delay periods of 250 ms inserted before s, and after sp. The stim- 
uli consist of broadband noise (2,000-20,000 Hz), generated as a series of sound 
pressure level (SPL) values sampled from a zero-mean normal distribution. The 
overall mean intensity of sounds varies from 60-92 dB. Rats should judge which 
out of the two stimuli, s, and sp, had the greater SPL standard deviation. If s, > sp, 
the correct action is to poke the nose into the right-hand nose port in order to 
collect the reward, and if s, < sp, rats should orient to the left-hand nose port. Trial 
durations are independently varied on a trial-by-trial basis, by varying the delay 
interval between the two stimuli, which can be as short as 2s or as long as 12s. Rats 
progressed through a series of shaping stages before the final version of the delayed 
comparison task, in which they learned to: associate light in the centre poke with 
the availability of trials; associate special sounds from the side pokes with reward; 
maintain their nose in the centre poke until they hear an auditory ‘go’ signal; and 
compare the two s, and s, stimuli. 

Although a substantial amount of data has been collected on all delay intervals 
from 2 to 12s, in this manuscript we focus on delay durations of 2, 4 and 6s, as 
most of the rats were consistently trained on these values. Training began when 
rats were two months old, and typically required three to four months for rats to 
display stable performance on the complete version of the task. 

Human auditory behaviour. Similar auditory stimuli to those used for rats were 
used in the human version of the task. In this experiment, subjects received, in each 
trial, a pair of sounds played from ear-surrounding noise-cancelling headphones 
(brand 233621-H501). The subject self-initiated each trial by pressing the space bar 
on the keyboard. The first sound was then presented together with a green square 
on the left side of a computer monitor in front of the subject. This was followed 
by a delay period, indicated by “WAIT! on the screen, then the second sound was 
presented together with a red square on the right side of the screen. At the end of 
the second stimulus and after the go cue, subjects were required to compare the 
two sounds and decide which one was louder, then indicate their choice by pressing 
the ‘k key with their right hand (second was louder) or the ‘s’ key with their left 
hand (first was louder). Written feedback about the correctness of their response 
was provided on the screen, for individual trials as well as the average performance 
updated every ten trials. 

Human tactile behaviour. In a separate set of experiments, run at the International 
School For Advanced Studies (SISSA), human subjects performed the tactile ver- 
sion of the task. The details of this task have been previously described and the 
behaviour has been characterized°. In brief, at each trial two noisy vibration stimuli, 
interleaved with a variable delay interval, were delivered to the subject’s fingertip 
on their left hand. Subjects viewed a computer monitor and wore headphones that 
presented acoustic noise and eliminated ambient sounds. To start a trial, the subject 
pressed the keyboard up arrow with their right hand. This triggered presentation 


of the two stimuli. After a post-stimulus delay, a blue panel was illuminated on the 
monitor, and the subject pressed the left or right arrow on the keyboard, signifying 
selection of the first or the second stimulus as greater, respectively. They received 
feedback (correct/incorrect) on each trial via the monitor. Human experiments 
were controlled using LabVIEW software (National Instruments). 

Stimulus set. If the first stimulus, s,, was fixed across all trials and only the second 
stimulus, sp, changed, subjects might solve the task by ignoring the first stimulus 
and applying a constant threshold to the second stimulus. Likewise, if the second 
stimulus was fixed, subjects might apply a constant threshold on the first stimulus. 
To prevent such alternative strategies, it is necessary to vary both s, and sp, and use 
a set of stimuli composed of pairs of s, and sp that guarantee that, across trials, the 
same value of SPL standard deviation is randomly presented for the first stimulus 
or the second stimulus. The stimuli along the diagonal in Fig. 1b represent such 
a stimulus set. A minimum of eight pairs of stimuli span a wide range of SPL standard 
deviation values (Fig. 1b). Using this stimulus set, if the subject were to ignore 
either s, or sp, then the maximum performance would be 63%. The mean ampli- 
tudes of stimuli were evenly distributed in a logarithmic scale (linear in dB). The 
diagonal line represents s, = sp; all of the stimulus pairs on one side of the diagonal 
are associated with the same action, and all have the same ratio of s, to sy. For each 
trial, one of these eight pairs of stimuli is randomly selected to determine s, and sp. 
Psychometric curves. In some sessions, we used stimulus pairs that were closely 
spaced along s, (Fig. 1b) to measure the psychometric discrimination threshold. 
Psychometric plots (as shown in Fig. le and Fig. 3c) show the probability of the 
subject responding leftward as a function of the difference between s, and s, when 
Sp is fixed. The fits were to a four-parameter logistic function of the form 


a 
“?) 


y(%) = Yo + 
1+ el b 


where yo is the left endpoint, yo + ais the right endpoint, xo is the bias, and a/4b is 
the slope. Fits were performed using the nonlinear least square regression method 
(nlinfit.m function) in MATLAB 2013. 

Regression model of behaviour. Our semi-automated training protocol 
facilitated the generation of a behavioural dataset comprising 468,165 trials 
from 25 trained rats, which in turn enabled statistical characterization of the 
decision-making process. To quantify the behaviour of the rats, we carried out an 
analysis to weight the contributions of s, and s, on the current trial and several 
trials in the past, as well as the contributions of the history of choice and reward 
on the rat’s choice in the current trial. Using the data generated by concatenating 
several training sessions, we fit the rat’s choice with a logistic regression model 
that allows for the linear combinations of s, and s, and other desired factors. 
The linear combination is then mapped non-linearly into the rat’s choice; that 
is, the probability of trials in which the subject judged s, > sp, through a logistic 
function as: 


1 — 2lapse 


P(left) = lapse + ero 


where 


n 
A=)>(s,'W,'+5,'W,' +bg +8) 


t=0 


where W,‘ and W;,’ are coefficients of the s, and s, regressors, respectively, from 
t trials back. ba is the correct side on the previous trial: left = +1, right = —1. This 
regressor captures the win-stay/lose-switch strategy. /3 is the baseline regressor that 
captures the overall (stimulus-independent) bias of the subject in calling s, > sp 
(for instance, a bias against turning right, the side associated with the judgment of 
Sq > Sp). The absolute values of all of the regressors are normalized between 0 and 
1. We used the log-likelihood as the cost function C: 


Cc y log(p) — (1— y)log(1 — p) . pie? log(p;) + (1 — y,)log(1 — Pi) 


The model was fit using a gradient-descent algorithm to minimize the negative 
log-likelihood cost function. We used the sqp algorithm in the fmincon function 
from MATLAB 2013. Weights were calculated using L2-regularization to prevent 
overfitting. The hyperparameter value (A) was selected independently for each rat 
using evidence optimization, on the basis of fivefold cross-validation. Different 
variants of the model, which systematically study the relevance of various sensory 
and reward history factors!*!”?, capturing not only win-stay/lose-switch but also 
perseveration, are discussed in Extended Data Fig. 6. 

Model comparison and cross-validation. All models were fit separately for each 
individual rat (n= 25), using 200 runs of fivefold cross-validation. For each run 
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we calculated the log-likelihood of the test dataset given the best-fit parameters 
on the training set (log/). We also calculated the log-likelihood of the test dataset 
for the mean value of %Left (the experimentally measured fraction of trials in 
which the rat went left). This gives us a null log-likelihood reference value (loglo). 
In order to quantify the efficiency of each model we defined the cross-validated 
bit/trial (CV-bit/trial) as the trial-averaged excess likelihood of the model com- 
pared to the null model*: 


(log 1 — log Io) /Mtials 
log(2) 


For each model, we first chose the optimal regularization value (\) that would 
maximize the CV-bit/trial. To compare different models, we calculated the 
median value of CV-bit/trial across 10,000 fits for each subject. Because in this 
method we measure the log-likelihood using the cross-validated data, it auto- 
matically addresses the overfitting problem, such that if additional parameters 
of one model result in overfitting in the training set, it would penalize it in the 
cross-validated test set. 

Optogenetic virus injection and fibre implantation. For optogenetic perturba- 
tion experiments, the general surgery techniques and fibre etching follow previ- 
ous reports*, except that we began the construction with a standard off-the-shelf 
50/125 jum LC-LC duplex fibre cable (http://www.fibercables.com), instead of the 
usual FC-FC duplex fibre cables. The cable jacket, strengthening fibres and outer 
plastic coating (typically white or orange) were fully removed, leaving 1 cm of the 
fibre-optic cable and the inner plastic coating (typically clear) intact. Then 2mm 
of the fibre tip (with the final layer of plastic coating still attached) was submerged 
in 48% hydrofluoric acid topped with mineral oil for 85 min, followed by water 
for 5min (submerging 5mm), and acetone for 2 min to soften the plastic. The 
plastic coating was then gently cut with a razor and removed with tweezers to 
reveal a 1-mm sharp-etched fibre tip. Sufficient plastic was removed, depending 
on the depth of the targeted site, to ensure that only the glass fibre optic would be 
inserted into the brain. 

For viral injections, 2 jl of adeno-associated virus (AAV) (AAV5-CaMKHa 
-eNpHR3.0-eYFP), which drives expression of the light-activated inhibitory opsin 
halorhodopsin eNpHR3.0, under the CaMKIIa promoter, coupled to eYFP, was 
lightly dyed with fast green powder and front-loaded into a glass pipette mounted 
to a Nanoject (Drummond Scientific) prefilled with mineral oil. The pipette tip 
was manually cut to a diameter of approximately 301m. Five closely spaced injec- 
tion tracts were used for each rat. For the central injection tract (anteroposterior 
—3.8mm and mediolateral 2.5mm from the bregma, Fig. 3a; brain image from 
Paxinos and Watson*!), one injection of 23 nl was administered every 100|1m in 
depth, starting 100|.m below the brain surface of the PPC, to a total depth of 
1.5mm. Four additional injection tracts were completed, using procedures iden- 
tical to those for the central tract, with one injection at 500 1m anterior, posterior, 
medial and lateral from the central tract. Each injection was followed by a 10-s 
pause, with 1 min after the final injection in a tract before the pipette was removed. 
A total of 1.51] of virus was injected over a 30-min period consisting of approx- 
imately 160 separate injections. A chemically sharpened fibre optic (501m core, 
125 1m cladding) was then lowered down the central injection tract to a depth of 
1mm. The craniotomy was filled with kwik-sil (World Precision Instruments), 
allowed to set for 10 min, and the fibre optic was secured to the skull with C&B 
Metabond and dental acrylic. Dental acrylic covered the entire incision site and 
allowed only the LC connector to protrude. Halorhodopsin expression was allowed 
to develop for six weeks before the behavioural testing began. 

Optogenetic perturbation. The implant in the rat was connected to a 1-m patch 
cable attached to a single-fibre rotary joint (Princetel) mounted on the ceiling of 
the behavioural chamber. This was connected to a 200 mW, 532 nm laser (OEM 
Laser Systems) operating at 25 mW, which was triggered with a 5 V transistor- 
transistor logic pulse. Laser illumination occurred on 20% of randomly selected 
trials. See Extended Data Fig. 7 for physiological confirmation of optogenetic 
inactivation effects in an anesthetized rat. On the basis of our previous quantifi- 
cations of optogenetic effects**, we estimate that using eNpHR3.0 we can inhibit, 
almost entirely, neurons in a radius of approximately 750|1m from the tip of the 
optic fibre, amounting to a sphere of around 1,500,.m in diameter. 

Recordings. Six rats were implanted with microwire arrays in their left or right 
PPC (n=3 in IPPC, n=3 in rPPC, see Extended Data Fig. 7 for histological local- 
ization of electrodes). The target region was accessed by craniotomy, using stand- 
ard stereotaxic techniques (centred 3.8 mm posterior to the bregma and 2.5mm 
lateral to the midline). Dura mater was removed over the entire craniotomy with a 
small syringe needle. The remaining pia mater, even if not usually considered to be 
resistant to penetration, nevertheless presents a barrier to the entry of the micro- 
electrode arrays, owing to the high-density arrangement of electrodes in the mul- 
ti-channel electrode arrays. This dimpling phenomenon, when the electrodes are 
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pushing the brain cortex down without penetrating, is more pronounced for arrays 
with larger numbers of electrodes. In addition to potentially injuring the brain 
tissue, dimpling is a source of error in the determination of depth measurements. 
Ideally, if dimpling could be eliminated, the electrodes would move in relation 
to the pial surface, allowing for effective and accurate electrode placement. To 
overcome the dimpling problem, we implemented the following procedure. 
After the craniotomy was made, and the dura was carefully removed over the 
entire craniotomy, a petroleum-based ointment (such as bacitracin ointment or 
sterile petroleum jelly (Puralube Vet Ointment)) was applied to the exact site of 
electrode implantation. The cyanoacrylate adhesive (Vetbond Tissue Adhesive) 
was then applied to the zone of the pia surrounding the penetration area. This 
procedure fastens the pia mater to the overlying bone and the resulting surface 
tension prevents the brain from compressing under the advancing electrodes. 
Once the polymerization of cyanoacrylate adhesive was complete, over a period 
of few minutes, the petroleum ointment at the target site was removed, and 
the 32-electrode microwire array (Tucker-Davis Technologies) was inserted by 
slowly advancing a Narishige hydraulic micromanipulator. After inserting the 
array(s), the remaining exposed cortex was covered with biocompatible silicone 
(kwik-sil), and the microwire array was secured to the skull with C&B Metabond 
and dental acrylic. 

During the ten-day recovery period, rats had unlimited access to water and food. 

Recording sessions in the apparatus began thereafter, using Neuralynx acquisition 
systems. Extracellular activity of the PPC neurons was manually sorted into single 
units and multi units, on the basis of the spike waveform and the refractory period 
observed in the interspike interval histogram, using SpikeSort3D software. In total 
1,081 single or multi units were recorded in the PPC of 6 rats. Only neurons for 
which the overall mean firing rate within the session was at least 2 Hz were included 
in the analysis, giving a total of 456. 
Neural analysis. Mutual information. To quantify the type and amount of infor- 
mation that the PPC neurons carry about various task parameters, we computed 
Shannon’s mutual information*>. In this formulation, the amount of information 
that can be extracted from the firing rate of a neuron R, about the task-related 
parameter X can be computed as: 


P(r|x) 
P(r) 


1(X, R) = Sy: P(x) > P(r, x)log, 


where P(r|x) is the conditional probability of observing a neuronal response 
r given the presentation of the task parameter x, P(r) is the marginal probability 
of occurrence of neuronal response r among all possible responses, and P(x) is 
the probability of task parameter x. Information measured in this way quantifies 
how well an ideal observer can discriminate between members of a stimulus set 
on the basis of the neuronal responses of a single trial®°, For each trial, neuronal 
response was defined as the rate of spiking during a time window of 100 ms. The 
conditional probability in the above formula is not known a priori and must be 
estimated empirically from a limited number, N, of experimental trials for each 
stimulus. Limited sampling of response probabilities can lead to an upward bias 
in the estimate of mutual information’. In order to correct for this bias, we used 
a combination of two techniques. First we estimated and corrected the bias based 
on the quadratic extrapolation method**, which assumes that the bias can be accu- 
rately approximated as second order expansions in 1/N. Then we used a bootstrap 
procedure (shuffling) that consists of many rounds of pairing stimuli and responses 
at random in order to destroy all the information that the responses carry about 
the stimulus. Owing to limited data sampling, the information computed using 
the bootstrapped responses may still be positive. The average value of the boot- 
strapped information was then used to estimate the residual bias of the information 
calculation, and was subtracted out. Moreover, the distribution of bootstrapped 
information values was used to build a non-parametric test of whether the cor- 
rected information computed using quadratic extrapolation method is significantly 
different from zero”’. 

Using the mutual information distribution from a shuffled dataset, at each 
time bin, in which trials are randomly labelled, we first calculated the bin-by-bin 
estimate of the percentage of cells with significant value of mutual information 
expected by chance (Fig. 4c, shuffled data). We then computed the average over 
the ITI or the duration of trial i to find the mean values depicted by dashed lines 
in Fig. 4d, e. To control for reward and choice, mutual information values were 
calculated using only trials with fixed choice and reward, and only then averaged 
across different, separately calculated reward and choice groups. 

Code availability. All software used for behavioural training is available on the Brody 
laboratory website at http://brodylab.org/auditory-pwm-task-code. Software used for 
data analysis is available from the corresponding authors upon reasonable request. 
Data availability. Raw and processed data are available from the corresponding 
authors upon reasonable request. 
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Extended Data Figure 1 | Full stimulus set, learning curves, mean 
performance and reward bias. a, Each stimulus is composed of a series of 
SPL values sampled from a zero-mean normal distribution, and standard 
deviation of s. For each trial, SPL values are randomly drawn and therefore, 
owing to sampling statistics, the actual standard deviation value of the 
stimulus always differed slightly from its designated value. The coordinates 
of each small box represent the actual joint values of (s,, s,) for one sample 
training session. b, Individual grey lines show learning curves presented 

as the change in percentage correct over months of training, for n = 25 
rats. An average rat (black line) reaches 70% of performance after 90 
sessions. c, Learning curve presented as the ratio of the best fit weights for 
the second stimulus, s,, to the first stimulus, s,, using the model described 
in Fig. 2e (three-parameter, no-history version). d, Rat auditory working 
memory performance, data from 21 rat subjects (total of 468,165 trials) 

are grouped according to (sq, sp) pair but averaged across subjects and over 
different delay durations (2-8 s). e, Human auditory working memory 


performance. For humans, the interstimulus delay varied randomly from 

2s to 6s. (11 subjects, 12,623 trials). f, Human tactile working memory 
performance; similar to e but for humans engaged in the tactile version of 
the task. In this task, the interstimulus delay varied randomly from 2s to 
8s. Data from 14 human subjects (total of 4,694 trials) are pooled together. 
g, Reward history bias. Left, the y axis shows, for turn-left trials and as a 
function of k, the percentage of subjects that went left when the k* trial 
back was rewarded on the left, minus the percentage that went left when 
the k' trial back was rewarded on the right. Right, the complementary 
plot for turn-right trials: the percentage that went right when the k"" trial 
back was rewarded on the right, minus the percentage that went right 
when the k'* trial back was rewarded on the left. Data from n = 21 rats. 
Each point shows the mean value of the bias over subjects. Error bars show 
95% confidence intervals. h, i, Similar to g for human auditory (h, n= 11 
subjects) and tactile (i, n = 14 subjects) PWM tasks. 
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Extended Data Figure 2 | Contraction bias grows as a function of the 
working memory delay interval of the current trial. a, Slopes from linear 
fits to the percentage leftward bias (as in Fig. 2a), for rats that were each 
trained on delay intervals of 2, 4, and 6s (n=21). The plot on the left 
shows the behavioural bias (percentage that went left minus the average) as 
a function of working memory delay interval of the current trial. The plot 
on the right shows the behavioural bias as a function of working memory 
delay interval from one trial back. Each dot represents a rat; lines connect 
the different delay intervals for each rat. Left: from a one-sided paired 
t-test, 2 versus 4s: P=0.012, 2 versus 6s: P< 0.001; 4 versus 6s: P< 0.001 
*P < 0.001, one-sided paired t-test. Right: 2 versus 4s: P=0.76, 2 versus 
6s: P=0.37; 4 versus 6s: P=0.65. The behavioural bias increases 

with greater current working memory delay period, but no significant 
dependence on the working memory delay period of the previous trial is 
found”*. b, Percentage correct averaged across all bias* trials or all bias” 
trials, relative to overall average performance, as a function of working 
memory delay interval on the current trial. Data are pooled from a dataset 
in which different rats were trained on different sets of delay intervals; 
data for each delay interval may therefore contain different rats than data 
for other delay intervals (n= 25 rats total). Error bars show s.d. As ina, 
behavioural effect grows as a function of the current working memory 


delay period. *P < 0.001, one-sided t-test. 

c, Schematics of stimuli used for three different psychometric curves: 
high s,, in which contraction bias would lead all the s, stimuli to be treated 
as lower than they actually were (indicated by the leftward arrows), 
producing a rightward shift of the psychometric curve; mid s,, in which 
contraction bias would lead all the s, stimuli to be treated as closer to s, 
than they actually were, producing a flattening of the psychometric curve; 
and low sp, in which contraction bias would lead all the s, stimuli to be 
treated as higher than they actually were, producing a leftward shift of the 
psychometric curve. d, Psychometric curves for low-sp trials, averaged 
across rats and separately for each individual rat, for trials with a 2-s 
working memory delay interval, and for trials with a 6-s working memory 
delay interval. Curves are fits to a four-parameter logistic function 

(see Methods). As the working memory delay interval grows, the leftward 
shift predicted by contraction bias shift is more pronounced. For each 
individual rat, nm = 120 sessions of data were used. Error bars show the 
s.e.m. over sessions. e, as in d but for the mid-s, trials. As the working 
memory delay interval grows, the flattening predicted by contraction bias 
is more pronounced. f, as in d but for the high-s, trials. As the working 
memory delay interval grows, the rightward shift predicted by contraction 
bias is more pronounced. 
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Extended Data Figure 3 | Sensory-history matrix, from one to five trials 
back. a, Stimulus-history matrix, as described in Fig. 2a, when percentage 
left is shown given any combination of the stimuli in the current trial 

(x axis) and n-trials back (y axis), n= 1, 2, 3, 4, 5. Trial numbers indicate 
pairs of (sa, sp), values in dB. 1: (68, 60); 2: (76, 68); 3: (84, 76); 4: (92, 84); 
5: (60, 68); 6: (68, 76); 7: (76, 84); 8: (92, 84). Data from n= 21 rats, 
comprising a total of 468,165 trials used in this analysis. b, Similar to a, 
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4 trials back 5 trials back 


Rat, N = 21 


Human, Auditory 
N=11 


Human, Tactile 
N=14 


14 14 1 14 

for the human auditory task. Trial numbers, with values in dB: 1: (62.7, 
60); 2: (65.4, 62.7); 3: (68.1, 65.4); 4: (70.8, 68.1); 5: (73.5, 70.8); 6: (60, 
62.7); 7: (62.7, 65.4); 8: (65.4, 68.1); 9: (68.1, 70.8); 10: (70.8, 73.5). 

c, Similar to a, for the human tactile task. Trial numbers, in mm s™!: 

1: (33, 23); 2: (46, 33); 3: (64, 46); 4: (90, 64); 5: (125, 90); 6: (175, 125); 

7: (245, 175); 8: (23, 33); 9: (33, 46); 10: (46, 64); 11: (64, 90); 12: (90, 125); 
13: (125, 175); 14: (245, 175). 
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Extended Data Figure 4 | Sensory-history matrix, controlled for reward 
and choice. Similar to Extended Data Fig. 2, except that in this plot only 
trials for which the previous trial resulted in the same action and reward 


status are included. Therefore, modulation by previous trial cannot be due 


to action history or reward history. Trial numbers are similar to those in 
Extended Data Fig. 3. 
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Extended Data Figure 5 | Short-term and long-term sensory history, 
and estimating the optimal window of (s). a, Slopes from linear fits to the 
percentage leftward bias from n-back trials (n = 1-7, as in Fig. 2a where 
n= 1 was used), and also (s) which is a window of 17 trials, from n= 4 to 
n= 20, in grey. Each point shows the mean of the slope values over n= 25 
rats. Error bars show 95% confidence intervals. b, For each rat the optimal 
exponential window over the past trials was estimated such that it would 
maximize the cross-validation bit/trial measurement. Two models are 
compared here: green shows the distribution of 7 values from a model that 
has five regressors to account for the sensory history—the first and second 


Single exponential model 
of sensory history 


Regressors: <s> 


stimulus from the two trials back and a separate exponential window over 
the remaining past trials (Fig. 2d). The results shown in orange are from a 
model containing only one regressor: a single exponential window over all 
the past trials accounts for the sensory history. In the single-exponential 
model, the best-fit value of 7 is very small, practically as if only past one 
or two trials back are inducing most of the effect. c, The five-parameter 
model of sensory history outperforms the single-exponential model. Two 
hundred iterations of fivefold cross validation were used to calculate the 
cross-validated bit/trial (see Methods). Accordingly, each bar shows the 
mean of m= 1,000 data points. Error bars denote s.d. 
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Extended Data Figure 6 | Model comparison. a, Model comparisons, 

200 runs of fivefold cross validation were performed, on data from each 
rat, in order to find the best fit parameters and to compare different model 
fits using the cross-validated bit/trial quantity defined as the relative value 
of the log likelihood of each model, to the null log likelihood, normalized 
in log2. Removing one parameter by constraining the regression weights 
on the s, stimulus of the current trial plus the weights on previous sensory 
stimuli to add to 1 (constrained model, in red) improved performance on 
cross-validated data compared to the unconstrained model (in black). 

A total of 12 different variants of the model are compared. Regressors are 
described in the box. b, Mean value of cross-validated bit/trial for different 
variants of the model as in a, over n = 20 rats. Error bars show s.e.m. 
Unconstrained models are shown in black, constrained models are shown 
in red. c, Top, raster plots of Ww, versus W;' (t=1, 2, from model 9). Each 
dot represents a subject. Pearson correlation values (r), and corresponding 
two-sided P values are shown for each plot. Bottom, median value of Ww,’ 
and Ww; (t= 1, 2), across rats. Error bars show median absolute deviation. 
d, Similar to c, for human subjects (auditory and tactile tasks are pooled 
together). Similar to rat subjects, model 9 shows the best performance for 
human subjects as well (data not shown). e, To compare the sensory- 
history matrix from the real data to the ones predicted from the best 
model fits (Fig. 2f, g), Frobenious distance norm was used, defined as the 


— Model #9 
(rats) 
Win-stay/ —_- reward left or right 1 
Lose-shitf 9g sensory 
B : bias history 
: i ect-side 
Preseverance : went left or right peo i 
0 


square root of the sum of the absolute squares of the difference between 
elements of two matrices. Frobenious distance is a measure of similarity, 
and the smaller the value, the more similar the two matrices. Frobenious 
distance is calculated separately for individual rats and each bar shows its 
mean value over n= 20 rats. Error bars show s.e.m. Models are models 
A-F from Fig. 2e. f, Scatter plot of slopes from linear fits to percentage 
leftward bias (Fig. 2a) versus short-term sensory history (that is, sum of 
weights for s;', s,', s,” and s,”) from model 9. This plot shows significant 
correlation between the two measurements (Pearson correlation, 

r= —0.66, two-sided P= 0.0084, n = 17 rats), suggesting that when our 
logistic fit coefficients are particularly large, the subjects also have a 
particularly large contraction bias. g, Examining the weights in regression 
model 9, which is determined to be the best model, shows that the weights 
for sensory-history terms are significantly larger than those for the 
correct-side history term (paired-sample t-test, P< 0.0001, n = 22 rats). 
Data from individual rats are used to fit the model and bars show the mean 
value of sensory-history weights (in blue), and correct-side history weight 
(in green), over fit values from n = 22 rats. Error bars show s.e.m. 
Moreover, the sensory history regressor term, that is, sum of sensory- 
history weights x regressors produces larger variance over trials (0.38) 
compared to the correct-side regressor (0.11), indicating a bigger impact 
on trial-by-trial behaviour. 
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a. Acute recordings during photoinhibition b. Electrode traces 
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Extended Data Figure 7 | Physiological and histological confirmations. 1,500, 3,000, 6,000 or 8,000 ms. The laser turning off is indicated by the 
a, Physiological confirmation of optogenetic inactivation effect in an vertical red dashed line. Recordings continued for 2s after the laser was 
anesthetized rat. Left, single trace of acute extracellular activity of an turned off. b, Histological localization of electrodes targeting the PPC. 
example cell in the PPC, expressing eNpHR3.0, is shown in response The inset shows an example of electrode locations in a coronal slice at 
to light stimulation. Laser illumination period (8 s) is marked by the anteroposterior = 3.48 from the bregma. In all cases, the electrode and 
light green bar. Right, raster-plot for 32 trials, for variable durations of fibre placements in the PPC were within between 2.8 and 4 mm posterior 
light stimulation. The green vertical dashed line indicates the start of the bregma and between 2 and 3.5 mm lateral to the midline. Atlas panel is 
the laser illumination. The laser was on for variable durations of 750, taken from Paxinos and Watson, 2004 (ref. 31). 
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a. Optogenetics - History Matrix (previous Left choice, all rewarded) 
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Frobenious Dist: 55.42 (laser on vs laser off) 
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b. Optogenetics - History Matrix (previous Right choice, all rewarded) 
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Extended Data Figure 8 | Optogenetics: PPC inhibition reduces 
leftward bias owing to past sensory stimuli. a, Sensory-history matrix 
and leftward biases due to past sensory stimuli, similar to Fig. 2a—c, but 
now for three types of trials: laser-off trials (two leftmost panels) that 
consist of trials with no PPC inactivation on either the current or the 


trials. This last set controls for number of trials, as it contains equal 
numbers of trials to the laser-on condition. Modulation along the vertical 
indicates a previous trial effect behavioural bias as a function of the stimuli 
of the previous trial, for trials for which rats went left, and were rewarded, 
therefore history of reward and choice is held fixed. Grey lines are different 
previous trial; laser-on trials (two middle panels) that consist of trials with current trial (sq, sp) pairs, the black line is the average over pairs. b, Similar 
PPC inactivation on the current trial; and laser-off-after-laser-on trials to a, for trials for which rats went right and were rewarded. c, Similar toa 
(two rightmost panels) that consist of trials immediately after the laser-on _for all combinations of current and previous stimuli. 
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Extended Data Figure 9 | Optogenetics: impact on contraction bias on 
the full stimulus set, individual data points and best fit parameters for 
non-sensory-history weights. a, Stimulus set and performance during 
optogenetic inhibition sessions, averaged over 37 sessions from 3 rats 
(delay interval of 2s). Trials are grouped based on laser-off (left) and 
laser-on (right) conditions. The boxes represent the set of (s,, sp) pairs 
used in a session, with the colour representing the percentage that went 
left and the numbers above each box indicating the percentage correct. 
The plot in the bottom shows the difference between laser-off and laser-on 
conditions, with positive values indicating improved performance in 
laser-on conditions and negative values indicating impaired performance. 
b, c, Similar to Fig. 3d-f, with all data points overlaid on the bar plots. 


For b, n= 37 for each bar plot (equal to the total number of inactivation 
sessions); for c, n= 600, from 200 iterations of threefold cross-validation 
data; *P < 0.01 from one sided f-test. d, Best-fit parameter values for all 
weights from the nine-parameter model (short-term sensory-history 
model, constrained version, Fig. 2d, e). Values are plotted as their mean 
once the average value from the laser-off condition is subtracted. Except 
for the sensory history, none of the other weights were significantly 
affected by optogenetic inactivation of the PPC. Error bars show s.d. 
(n= 600, 200 iterations of threefold cross-validation; *P < 0.01 from one 
sided t-test). e, Similar to d, for period-selective optogenetic inhibitions, 
in which the PPC is selectively inhibited during the first stimulus s, (left), 
delay interval (middle) or second stimulus sp (right). 
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Extended Data Figure 10 | Mutual information. a, Sensory-history 
coding, one trial back, population analysis, each row represents the time 
course of significant values of mutual information between the firing rate 
of a cell and the stimulus pair (s,, s,) presented on the previous trial. Data 
from all trials with variable delay duration (minimum of 2 s) were pooled 
and plots are aligned to the beginning of s,. Data from n=5 rats, and only 
cells with significant values of mutual information values are included. 
When estimating the mutual information, spurious information values 
can be attributed to the inherent correlations between task parameters, 
such as sensory stimuli and choice. To overcome this, conditional mutual 
information was calculated only when trials with same previous choice 
and reward status were considered, and sensory inputs were the only 
variable. Left, on the previous trial rats went right and were rewarded. 
Right, on the previous trial rats went left and were rewarded. b, Sensory- 
history coding, one trial back, percentage of cells with significant coding 
of stimuli presented on the previous trial (trial i — 1), aligned to the start 
of trial i. Only trials with a delay interval larger than 4 s are included in 
this analysis. c, Sensory-history coding, two trials back, percentage of 
cells with significant coding of stimuli presented two trials in the past 
(trial i — 2), aligned to the start of trial i. Shaded horizontal areas show 
the mean +s.d. of the percentage of neurons with significant mutual 
information (MI), calculated from random sets built by shuffling the firing 
rates of neurons and conditions. d, Percentage of cells with significant 
coding of a rat’s choice and reward status, on both the current trial (solid 
lines) and previous trial (dashed lines), when time is aligned to the 
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current trial, either s, (left), or s, (right). Shaded horizontal areas show 
the mean +s.d. of the percentage of neurons with significant mutual 
information, calculated from random sets built by shuffling the firing 
rates of neurons and conditions. e, In the standard stimulus set (Fig. 1b, 
(sa, Sp) pairs along the diagonal lines), knowledge of the rat’s choice of side, 
whether it was rewarded or not, and one of either s, or sp enables unique 
identification of the other stimulus (s, or s,). Therefore, in order to probe 
whether neurons carried information for different values of s, itself (as 
opposed to a combination of choice, reward and s,), we ran recording 
sessions with psychometric stimuli added to the standard stimulus set 
(top left). In this way, three different values of s, are assigned to one fixed 
value of sp and one fixed action (left in different shades of red, and right in 
different shades of blue). The firing rate of an example neuron is shown in 
response to different values of s,, only for trials in which the rat responded 
by going left (middle graph) or right (right graph) after the ‘go’ cue, was 
rewarded, and the delay interval was 4 s. Even though choice, reward 

and sp are fixed, firing rates clearly differentiate values of s,. The bottom 
graph shows a summary of population analysis from psychometric 
recording sessions (as in the examples in the graphs above), showing the 
percentage of cells with significant coding of s, from trial i (red) or trial 
i— 1 (blue, n= 142 cells). Shaded horizontal areas show the mean +s.d. of 
the percentage of neurons with significant mutual information, calculated 
from random sets built by shuffling the firing rates of neurons and 
conditions. 
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c-MAF-dependent regulatory T cells mediate 
immunological tolerance to a gut pathobiont 


Mo Xu!*, Maria Pokrovskii', Yi Ding’, Ren Yi>, Christy Au!‘, Oliver J. Harrison®, Carolina Galan!, Yasmine Belkaid>®, 


Richard Bonneau**”* & Dan R. Littman! 


Both microbial and host genetic factors contribute to the 
pathogenesis of autoimmune diseases'~*. There is accumulating 
evidence that microbial species that potentiate chronic 
inflammation, as in inflammatory bowel disease, often also 
colonize healthy individuals. These microorganisms, including 
the Helicobacter species, can induce pathogenic T cells and are 
collectively referred to as pathobionts*®. However, how such 
T cells are constrained in healthy individuals is not yet understood. 
Here we report that host tolerance to a potentially pathogenic 
bacterium, Helicobacter hepaticus, is mediated by the induction of 
ROR~ttFOXP3* regulatory T (iT,-g) cells that selectively restrain 
pro-inflammatory T helper 17 (Ty17) cells and whose function is 
dependent on the transcription factor c-MAF. Whereas colonization 
of wild-type mice by H. hepaticus promoted differentiation of 
ROR~t-expressing microorganism-specific iT,., cells in the large 
intestine, in disease-susceptible IL-10-deficient mice, there was 
instead expansion of colitogenic Ty17 cells. Inactivation of c-MAF 
in the T,.g cell compartment impaired differentiation and function, 
including IL-10 production, of bacteria-specific iT,., cells, and 
resulted in the accumulation of H. hepaticus-specific inflammatory 
Ty17 cells and spontaneous colitis. By contrast, RORt inactivation 
in T,eg cells had only a minor effect on the bacteria-specific T,.. and 
TyI17 cell balance, and did not result in inflammation. Our results 
suggest that pathobiont-dependent inflammatory bowel disease is 
driven by microbiota-reactive T cells that have escaped this c-MAF- 
dependent mechanism of iT,eg—Ty17 homeostasis. 

We chose H. hepaticus as a model to investigate host-pathobiont 
interactions. Blockade of IL-10RA induced inflammation of the large 
intestine in H. hepaticus-colonized 1123rGFP reporter mice”®, increas- 
ing the proportion of green fluorescent protein-positive (GFP*) cells 
(predominantly T}17 cells) from approximately 10% to 50% of large 
intestine CD4* T cells (Extended Data Fig. 1a). We therefore sought 
to determine why H. hepaticus-induced T cells do not cause disease 
in wild-type mice at the steady state. To address this question, we first 
identified the T cell receptor (TCR) sequences and cognate epitopes of 
H. hepaticus-induced Ty17 cells that expand during inflammation, and 
subsequently traced the fate of these cells at the steady state. 

We cloned individual TCR sequences from colitogenic IL-23R-GFP* 
T cells (Extended Data Fig. 1b) and found that nine out of twelve clono- 
typic TCRs were H. hepaticus-specific (Extended Data Fig. 1c). We 
subsequently identified”* a H. hepaticus-unique protein, HH_1713, 
that contains two immunodominant epitopes. The El peptide epitope, 
presented by I-A®, was recognized by H. hepaticus-specific TCR HH5-1, 
whereas E2 was recognized by TCR HH5-5, HH6-1 and HH7-2 
(Extended Data Fig. 1c). We next developed two complementary 


approaches to track H. hepaticus-specific T cells in vivo®!, HH7-2 and 
HH5-1 TCR transgenic mice (HH7-2tg and HH5-1tg) and a major 
histocompatibility complex (MHC) class II tetramer loaded with E2 
peptide (HH-E2 tetramer) (Extended Data Fig. 1d-g). 

To track what happens to H. hepaticus-specific T cells in healthy 
mice, we simultaneously transferred naive T cells from HH7-2tg and 
7B8tg (segmented filamentous bacteria (SFB)-specific TCRtg control)® 
mice into wild-type mice that were stably colonized with H. hepaticus 
and SFB (Fig. 1a). Two weeks after adoptive transfer, HH7-2tg donor 
cells were enriched in the large intestinal lamina propria (LILP) and 
caecal patch, whereas 7B8tg cells predominated in the small intestinal 
lamina propria (SILP) and Peyer's patches (Extended Data Fig. 2a, b), 
consistent with colonization of H. hepaticus in the large intestine and 
SEB in the small intestine. As previously reported, 7B8 cells developed 
into T}17 cells that were largely positive for RORyt and negative for 
FOXP3$ (Fig. 1b, c, Extended Data Fig. 2c, d). By contrast, HH7-2tg 
cells in the LILP were mostly iTyeg cells that express both RORyt and 
FOXP3 (approximately 60% of total donor-derived HH7-2tg cells)'?"?, 
rather than Ty17 cells (less than 10% of total HH7-2tg cells) (Fig. 1b, c, 
Extended Data Fig. 2c, d). Notably, two other colonic Treg cell markers, 
GATA3 and ST2, were not expressed on HH7-2tg cells! (Extended 
Data Fig. 2e). 7B8tg and HH7-2tg T cells that expressed neither RORyt 
nor FOXP3 were mostly T follicular helper (Tpy) cells and were 
enriched in the Peyer’s patches and caecal patch (Fig. 1b, c, Extended 
Data Fig. 2c, d). Breeding HH7-2tg mice onto the Rag] ~’~ background 
excluded the possibility that HH7-2tg iT,eg cells detected after adop- 
tive transfer were contaminated by thymus-derived natural Treg (nTyeg) 
cells or were influenced by the presence of dual TCRs (Extended Data 
Fig. 3a—c). Adoptively transferred HH5-1tg and HH-E2-tetramer- 
positive cells had differentiation profiles similar to HH7-2tg cells 
(Fig. 1d, e, Extended Data Figs 2fand 3d, e). These results indicate that 
the host responds to H. hepaticus by generating immunotolerant iTyeg 
cells rather than pro-inflammatory Ty17 cells. 

To examine whether the iT,., cell-dominant differentiation of 
H. hepaticus-specific T cells is altered during intestinal inflammation, 
we co-transferred naive HH7-2tg and control 7B8tg T cells into 
colonized I110~/~ recipients. Notably, only a small proportion of the 
transferred HH7-2tg T cells expressed FOXP3 in the LILP. Instead, 
most of them differentiated into pro-inflammatory Ty17 cells with 
Ty1-like features, characterized by the expression of both RORyt 
and T-bet and high levels of IL-17A and IFNy upon re-stimulation'* 
(Fig. 2a—f, Extended Data Fig. 4a, c, d). These results were recapitu- 
lated with HH5-1tg T cell adoptive transfer and endogenous HH-E2 
tetramer™ T cells (Extended Data Fig. 4e-g). By comparison, disrup- 
tion of IL-10-mediated immune tolerance did not result in deviation of 
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Figure 1 | H. hepaticus induces ROR tt Treg and Try cell responses 
under steady state. a, Experimental scheme for co-transfer of congenic 
isotype-labelled HH7-2tg and 7B8tg cells into wild-type (WT) mice 
colonized with H. hepaticus and SFB. b, c, RORyt, FOXP3, BCL6 

and CXCRS5 expression (b) and frequencies of Tyeg (F OXP3*), Ty17 
(FOXP3” RORt*) and Try (BCL6*CXCR5*) (c) among donor-derived 
T cells in indicated tissues. Data are from 1 of 3 experiments, with 

n= 15 in the 3 experiments. CP, caecal patch; PP, Peyer’s patches. APC 


SFB-specific Ty17 cells to the inflammatory Ty17-Ty] cell phenotype 
(Fig. 2c, d, Extended Data Fig. 4a—d). Furthermore, we observed similar 
deviated T cell responses to H. hepaticus in models of T cell transfer coli- 
tis and Citrobacter rodentium-induced colonic inflammation, but not in 
dextran sulfate sodium (DSS) colitis, an innate immunity-dependent 
model (Extended Data Fig. 5a~h). Commensal microorganism- 
specific T cells can thus acquire pro-inflammatory phenotypes 


(allophycocyanin), Cy5.5, eF780, FITC, PE (phycoerythrin) and PerCP 
denote fluorochrome labels. d, e, Wild-type mice (n= 6) were colonized 
with H. hepaticus for 3-4 weeks and analysed for RORyt, FOXP3 and 
BCL6 expression in total CD4* (red) and HH-E2 tetramer* (blue) T cells 
from the LILP (d) and frequencies of Treg (FOXP3*), TH17 (FOXP3” RORt*) 
and Tp (BCL6*) cells among HH-E2 tetramer* T cells in the LILP and 
caecal patch (e). Data summarize two independent experiments. 


during enteric infection!’, although the high frequency of such 
infections suggests the existence of a mechanism to re-establish gut 
tolerance. Our observations of H. hepaticus-specific iTeg~Ty17 skewing 
during colitis are consistent with a contemporaneous study using two 
different Helicobacter species'®. These findings indicate that dysregu- 
lated T cell tolerance to pathobionts may be a general hallmark of 
inflammatory bowel disease. 
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frequencies of T-bet expression among Ty17 
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four independent experiments. e, f, IL-17A 
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Figure 3 | c-MAF is required for the differentiation and function of 
induced T,eg cells in the gut. a, b, Transcription factor staining in total 
CD4* (a) and HH-E2 tetramer® (b) T cells from the LILP of indicated 
mice. Left, RORyt and FOXP3 expression. Right, frequencies of indicated 
Treg (FOXP3*) and Ty17 (FOXP3~ ROR}t*) cell subsets. Mice were 
colonized with H. hepaticus for 5-6 weeks before analysis. Data summarize 
3 independent experiments for Rorc*"8 (n =7) and littermate controls 
(ctrl; n=7 for a, n= 6 for b), and 4 independent experiments for Mafs""8 
(n= 10) and littermate controls (n = 8). c—e, Co-transfer of Mafs"8 and 
control HH7-2tg T cells into wild-type H. hepaticus-colonized mice. 

c, Left, donor cell composition in the LILP of recipient mice. Right, 
RORyt and FOXP3 expression in indicated donor-derived cells. d, Ratios 
of Maf*"" (knockout, KO) versus control (WT) HH7-2tg donor-derived 
cells in the LILP. Dashed line represents ratio of co-transferred cells before 
transfer. e, Frequencies of Treg (FOXP3*) and Ty17 (FOXP3” RORYt*) 


We next wished to determine whether RORyt* Treg Cells are criti- 
cal for immune tolerance to gut pathobionts. The transcription 
factor c-MAF attracted our attention as it was highly enriched in 
these cells'+!” (Extended Data Fig. 6a) and known to promote an anti- 
inflammatory program, for example, IL-10 expression in other T helper 
cell subsets'*°. We therefore deleted Maf with Fox, ee to test its func- 
tion in Treg cells. In H. hepaticus-colonized Maf™'; Foxp3°” (Maf*""®) 
mice, despite incomplete depletion of c-MAF protein (Extended Data 
Fig. 6b), there was a marked decrease in the proportion of RORytT 
but not RORyt~ Treg cells among CD4* T cells in the large intestine, 
and a concomitant increase in Ty17 cell frequency (Fig. 3a). Maf*1"8 
mice also had expanded numbers of total CD4* T cells in the large 
intestine, reflected by a pronounced accumulation of Ty17 cells, but 
notably not RORyt* Treg Cells (Extended Data Fig. 6c). By contrast, 
after H. hepaticus colonization, Ty17 cell expansion was less striking in 
Rord™!!, Foxp3 ® (Rorc4 8) mice (Fig. 3a, Extended Data Fig. 6c), and 
neither a decrease of RORytT Treg cells nor an expansion of Ty17 cells 
was observed in Gata3""";Foxp3 (Gata3°"8) mice (Extended Data 
Fig. 6d, e). The altered frequency of RORYt* Treg and Ty17 cell subsets 
led us to test whether the fate of H. hepaticus-specific T cells would be 
affected in the Maf*1"8 and Rorc*!8 mice. Notably, HH-E2 tetramer* 
cells were predominantly Ty17 in Maf*"”8 mice, but mostly RORyt* 
Treg in control mice (Fig. 3b, Extended Data Fig. 6f). By contrast, although 
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cells among donor-derived cells. Data are a summary of 10 mice from 2 
independent experiments. a, b, e, Statistics were calculated by unpaired 
two-sided Welch’s t-test. Error bars denote mean + s.d. P values are as 
follows: a, i= 1.21 x 107%, ii=8.82 x 1077, iti = 0.016, iv=6.38 x 10-4, 
v= 8.06 x 10-4 and vi= 9.89 x 1077. b, i=0.056, ii=7.48 x 10-4, 

iii = 7.64 x 1077 and iv=6.01 x 107°. e, i=6 x 10°. NS=0.86. 

f, MA plot depicting RNA-seq comparison of donor naive T cell-derived 
Maf*" versus control FOXP3-YFP* iTyeg cells (mean of two biologically 
independent experiments). Blue dots indicate 190 upregulated and 75 
downregulated genes in the c-MAF-dependent signature. Highlighted 
blue dots represent downregulated genes related to Treg cell function, and 
highlighted red dots indicate genes that are also dependent on ROR t!!. 
Differentially expressed genes were calculated in DESeq2 using the Wald 
test with Benjamini-Hochberg correction to determine the false discovery 
rate (FDR) (FDR< 0.1 and log) fold change > 1.5). 


Rorc*"*8 mice also had an increased proportion of H. hepaticus- 
specific Ty17 cells, most tetramer* cells were Treg cells (Fig. 3b, 
Extended Data Fig. 6f). Collectively, these results suggest that 
pathobiont-specific ROR t* iTreg cells are required for the suppression 
of inflammatory Ty17 cell accumulation. Although ROR expression 
contributes to gut iT;eg cell function, c- MAF has a more substantial 
role in the differentiation and/or function of these cells. H. hepaticus- 
specific Tpy cell differentiation in the caecal patch did not seem to be 
affected in Maf*""8 mice (Extended Data Fig. 6g). Notably, as in IL-10- 
deficient mice, SFB-specific Ty17 cells neither expanded nor adopted 
a Ty1-like phenotype in Maf*""* mice (Extended Data Fig. 6h, i). 
A potential explanation is that SFB- and H. hepaticus-specific Ty17 
responses are instructed by different innate immune pathways”””!. 

To investigate how c-MAF regulates the gut RORytT Treg -Ty17 
cell axis, we co-transferred equal numbers of naive Maf'!*;Foxp3° 
(control) and Maf™"; Foxp3° HH7-2tg cells into H. hepaticus-colonized 
wild-type mice (Extended Data Fig. 7a, b). Two weeks after adoptive 
transfer, the Maf";Foxp3 HH7-2tg cells were markedly underrepre- 
sented compared to control cells in the LILP, mesenteric lymph nodes 
(mLNs) and spleen, and were unable to form iT;eg cells (Fig. 3c—e 
and Extended Data Fig. 7c, d). Importantly, at homeostasis, mutant 
donor-derived cells did not give rise to a high frequency of Ty17 cells 
(Fig. 3e). Transcriptomics analysis revealed that the c-MAF-deficient 
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Figure 4 | ROR t* iT, eg cells are required to maintain gut homeostasis. 
a, Frequency of rectal prolapse by genotype. b, Left, spleens and mLNs 
from Maf*""® and littermate controls. Right, total cell numbers in mLNs. 
Data summarize three independent experiments for Rorc*"”8 (n = 6) 

and littermate controls (n =7), and four independent experiments for 
Maf "8 (n = 9) and littermate controls (n = 8). c, Number of leukocytes 
in the LILP. Data summarize three independent experiments for Rorc*"*8 
(n=7) and littermate controls (n = 8), and four independent experiments 
for Maf*"”8 and littermate controls (n= 9). d, Representative histology 
of large intestine (LI) sections (left) and colitis scores (right) of mice with 
indicated genotypes. Rorc™8 (n= 8) and littermate controls (n=9). 
Maf*"8 (n= 11) and littermate controls (n = 9). Scale bars, 50 tm. 


iTyeg cells were functionally impaired, as indicated by defective expres- 
sion of I110 and other Tyeg cell signature genes, as well as of RORt- 
dependent genes’ (Fig. 3f, Extended Data Fig. 7e-h). Taken together, 
these findings show that c-MAF is a crucial cell-intrinsic factor for 
both the generation and function of microorganism-specific iT,cg cells. 
Notably, the vast majority of accumulated Tyj17 cells in Maf*1"8 mice 
expressed c-MAF, indicating that the bulk of these cells did not arise 
from Treg cells in which c-MAF was deleted (Extended Data Fig. 7i). 
Thus, suppression of Ty17 cell expansion is mediated by these iTyeg 
cells in trans. 

ROR“ expression in iTyeg cells has been implicated in the main- 
tenance of gut immune homeostasis under different challenges!?”. 
However, spontaneous gut inflammation in Rorc*!”8 mice has not 
been described. We noticed that Maf*", but not Rorc*"8 or con- 
trol littermates, were prone to rectal prolapse (Fig. 4a). Maf*!"8 mice 
(8-12 weeks old) colonized with H. hepaticus for 5-6 weeks had 
enlarged large intestine-draining mLNs, and increased cellularity of 
mLNs and the large intestine (Fig. 4b, c). Histopathological analysis 
of the large intestine of these mice revealed mixed acute and chronic 
inflammation (Fig. 4d). Without H. hepaticus colonization, aged 
(6-12 months old) Maf*""8 mice also exhibited mild spontaneous 
colitis (Fig. 4e). Notably, none of the above changes was observed in 
Rorc 8 mice (Fig. 4b-d). Thus, c-MAF but not ROR‘ expression in 
iTyeg Cells is crucial for the suppression of spontaneous inflammation. 
Indeed, the transcriptional profile of H. hepaticus-specific T effector 
(Terr) cells from Maf*""8 mice with spontaneous colitis was highly 
similar to that of pathogenic Ty17 cells in IL-10RA blockade-induced 
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e, Colitis scores in aged H. hepaticus-negative Maf*""8 (n=6) and 
littermate control (n =7) mice. f, g, Suppression of H. hepaticus-specific 
TCRtg cell-mediated transfer colitis by in vitro differentiated iTreg cells. 
Data summarize two independent experiments with indicated sample size 
(n) in total. Colitis scores for Rag1~/~ mice that received the indicated 
TCRtg naive T cells and iT,eg combinations. All statistics were calculated by 
unpaired two-sided Welch's t-test. Error bars denote mean + s.d. P values 
are indicated on the figure or as follows: f, i= 5.34 x 10-4, ii= 1.24 x 10-7, 
iii = 3.64 x 10-4, iv=3.26 x 10°-* and v= 4.54 x 10-*. NS=0.056. 

g, i=0.013, ii=2.50 x 10°, iti =4.59 x 10-4, iv= 0.024, and v=0.016. 
NS=0.12. 


colitis, but differed markedly from homeostatic Ty17 cells (which are 
predominantly SFB-specific) (Extended Data Fig. 8a-f). 

Similar to the Maf*?"*8 strain, mice with inactivation of STAT3 
in the T,.¢ compartment or impaired TGF@ signalling in CD4* 
T cells also lacked ROR yt* Treg cells and developed spontaneous 
colitis'***3 (Extended Data Fig. 9a, b). Consistent with these find- 
ings, c-MAF expression in Tyeg cells required a combination of both 
TGF@ and STATS signals in vitro and in vivo, as it does in other CD4t 
T cells'®!° (Extended Data Fig. 9a-c). This suggests that c- MAF 
integrates anti-inflammatory TGFS receptor signals with microorganism- 
induced cytokine-dependent STAT3 activation to mediate RORYt* Treg 
induction. 

Although c-MAF is also expressed, albeit at a lower level, in nTyeg 
cells, c-MAF-deficient and -sufficient nT;eg cells showed equiv- 
alent activity in inhibiting T.g cell proliferation in vitro, as well as 
in suppressing pathogenesis in a model of T cell transfer colitis 
in vivo (Extended Data Fig. 10a, b). We therefore wondered why, 
despite their increased numbers (Extended Data Fig. 6c), nTreg cells 
were not sufficient to establish gut homeostasis in Maf*!" mice. 
Adoptive transfer of 1,000 naive HH7-2tg or HH5-1tg cells into 
H. hepaticus-colonized Rag1~/~ mice led to colitis. Taking advantage 
of this system, we compared the suppressive function of iTyeg cells 
differentiated in vitro from naive HH7-2tg, HH5-1tg and polyclonal 
T cells. We found that epitope-specific iT,eg were better at suppress- 
ing colitis, providing a potential explanation for why pathobiont- 
specific iTreg cells are required in addition to nT;2g cells to maintain gut 
homeostasis” (Fig. 4f, g). 
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Our results reveal a mechanism for how a healthy individual can 
host a ‘two-faced’ commensal pathobiont such as H. hepaticus with- 
out developing inflammatory disease. Our findings suggest that Tyeg 
cell induction serves as a strategy to establish commensalism, not only 
by helping the microorganisms to colonize their niche”’, but also by 
protecting the host from inflammation. A similar requirement for iTyeg 
cells has also been reported in the establishment of food tolerance”®. 
Our observations in Maf*"" mice are linked to and help explain the 
expansion of colitogenic Ty17 cells in mice with T,..-specific inacti- 
vation of STAT3”’. Like c-MAK, STAT3 is probably required for the 
differentiation and/or function of microbiota-induced RORytt iTyeg 
cells'”. Moreover, microorganism-specific iT, eg cells, compared with 
non-specific nTyeg cells, can better suppress inflammatory Ter cells by 
recognizing the same epitopes. This result raises the prospect of har- 
nessing the mechanisms of pathobiont-specific iT, eg cell responses to 
re-establish homeostasis in patients with inflammatory bowel disease, 
for example, by engineering non-pathogenic Tyeg cell-inducing micro- 
organisms”’ to express pathobiont antigens. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Mice. Mice were bred and maintained in the animal facility of the Skirball Institute 
(New York University School of Medicine) and the National Institute of Allergy and 
Infectious Diseases (NIAID) in specific pathogen-free conditions. C57BL/6 mice 
were obtained from Jackson Laboratories or Taconic Farm. I110~/~ (B6.129P2- 
I110""!°8"/7) mice were purchased from Jackson Laboratories and bred with wild- 
type C57BL/6 mice, which subsequently generated I110*/~ and I110~'~ littermates 
by heterozygous breeding. CD4-dnTGFbRII mice” were purchased from Jackson 
Laboratories, and bred with wild-type C57BL/6 mice to generate CD4-dnTGFbRII 
and wild-type littermates. Cd4°" (Tg(Cd4-cre)1Cwi/BfluJ) and Cd45.1 (B6.SJL- 
Ptprca Pepcb/BoyJ) mice were purchased from Jackson Laboratories. Foxp3?Y!? 
mice were previously described and obtained from Jackson Laboratories”®. [123r8? 
and Maf" strains were previously described”’®° and provided by M. Oukka and C. 
Birchmeier, respectively. Stat3";Cd4* mice were provided by D. E. Levy. Gata3™", 
Foxp3‘’*¥? mice were bred at the NIAID. Littermates with matched sex (both 
males and females) were used. Except the aged mice (6-12 months old) analysed 
in the experiments of Fig. 4e, mice in all the experiments were 6-12 weeks old at 
the starting point of treatments. Animal sample size estimates were determined 
using power analysis (power = 90% and a= 0.05) based on the mean and standard 
deviation from our previous studies and/or pilot studies using 4-5 mice per group. 
All animal procedures were performed in accordance with protocols approved by 
the Institutional Animal Care and Usage Committee of New York University School 
of Medicine or the NIAID as applicable. 

Antibodies, intracellular staining and flow cytometry. The following monoclonal 
antibodies were purchased from eBiosciences, BD Pharmingen or BioLegend: CD3 
(145-2C11), CD4 (RM4-5), CD25 (PC61), CD44 (IM7), CD45.1 (A20), CD45.2 
(104), CD62L (MEL-14), CXCR5 (L138D7), NPR-1 (3E12), ST2 (RMST2-2), 
TCR®B (H57-597), TCR VB6 (RR4-7), TCR V8.1/8.2 (MR5-2), TCR V314 (14-2), 
BCL6 (K112-91), c-MAF (T54-853), FOXP3 (FJK-16s), GATA3 (TWAJ), Helios 
(22F6), RORyt (B2D or Q31-378), T-bet (eBio4B10), IL-10 (JES5-16E3), IL-17A 
(eBiol7B7) and IFNy (XM61.2). 4’,6-diamidino-2-phenylindole (DAPI) or Live/ 
dead fixable blue (ThermoFisher) was used to exclude dead cells. 

For transcription factor staining, cells were stained for surface markers, 
followed by fixation and permeabilization before nuclear factor staining according 
to the manufacturer’s protocol (FOXP3 staining buffer set from eBioscience). For 
cytokine analysis, cells were incubated for 5h in RPMI with 10% FBS, phorbol 
12-myristate 13-acetate (PMA) (50ng ml}; Sigma), ionomycin (500 ng ml}; 
Sigma) and GolgiStop (BD). Cells were stained for surface markers before fixa- 
tion and permeabilization, and then subjected to intracellular cytokine staining 
according to the manufacturer's protocol (Cytofix/Cytoperm buffer set from BD 
Biosciences). 

Flow cytometric analysis was performed on an LSR II (BD Biosciences) or an 
Aria II (BD Biosciences) and analysed using FlowJo software (Tree Star). 
Isolation of lymphocytes. Intestinal tissues were sequentially treated with PBS 
containing 1 mM DTT at room temperature for 10min, and 5mM EDTA at 37°C 
for 20 min to remove epithelial cells, and then minced and dissociated in RPMI 
containing collagenase (1 mg ml! collagenase II; Roche), DNase I (100,.g ml’; 
Sigma), dispase (0.05 U ml '; Worthington) and 10% FBS with constant stirring at 
37°C for 45 min (small intestine) or 60 min (large intestine). Leukocytes were col- 
lected at the interface of a 40%/80% Percoll gradient (GE Healthcare). The Peyer's 
patches and caecal patch were treated in a similar fashion except for the first step of 
removal of epithelial cells. Lymph nodes and spleens were mechanically disrupted. 
Single-cell TCR cloning. [/23r°'”’+ mice were maintained in SFB-free conditions 
to guarantee low Ty17 background levels. To induce a robust Ty17 cell response, 
the mice were orally infected with H. hepaticus and injected intraperitoneally with 
1 mg anti-IL-10RA (clone 1B1.3A, Bioxcell) every week from the day of infection. 
After two weeks, large intestine GFP* CD4* T cells were sorted on the BD Aria 
I and deposited at one cell per well into 96-well PCR plates pre-loaded with 5 il 
high-capacity cDNA reverse transcription mix (Thermo Fisher) supplemented 
with 0.1% Triton X-100 (Sigma-Aldrich). Immediately after sorting, whole plates 
were incubated at 37°C for 2h, and then inactivated at 85°C for 10 min for cDNA 
preparation. A nested multiplex PCR approach described previously was used 
to amplify the CDR3a and CDR38 TCR regions separately from the single cell 
cDNA®!. PCR products were cleaned up with ExoSap-IT reagent (USB) and 
Sanger sequencing was performed by Macrogen. Open reading frame nucleotide 
sequences of the TCRa and TCR families were retrieved from the IMGT database 
(http://www.imgt.org)?. 

Generation of TCR hybridomas. The NFAT-GFP 58°37 hybridoma cell line 
was provided by K. Murphy’. To reconstitute TCRs, CDNA of TCRa and TCR8 
were synthesized as gBlocks fragments by Integrated DNA Technologies (IDT), 
linked with the self-cleavage sequence of 2A (TCRa-p2A-TCR8), and shuttled into 
a modified MigR1 retrovector in which IRES-GFP was replaced with IRES-mCD4 
(mouse CD4) as described previously’. Then retroviral vectors were transfected 


into Phoenix E packaging cells using TransIT-293 (Mirus). Hybridoma cells were 
transduced with viral supernatants in the presence of polybrene (81g ml!) by 
spin infection for 90 min at 32°C. Transduction efficiencies were monitored by 
checking mCD3 surface expression three days later. 

Assay for hybridoma activation. Splenic dendritic cells were used as 
antigen-presenting cells (APCs). B6 mice were injected intraperitoneally with 
5 x 10° FLT3L-expressing B16 melanoma cells to drive APC proliferation as previ- 
ously described™. Splenocytes were prepared 10 days after injection, and positively 
enriched for CD11c* cells using MACS LS columns (Miltenyi). 2 x 104 hybridoma 
cells were incubated with 10° APCs and antigens in round bottom 96-well plates 
for two days. GFP induction in the hybridomas was analysed by flow cytometry 
as an indicator of TCR activation. 

Construction and screen of whole-genome shotgun library of H. hepaticus. 
The shotgun library was prepared with a procedure modified from previous 
studies”*. In brief, genomic DNA was purified from cultured H. hepaticus with 
DNeasy PowerSoil kit (Qiagen). DNA was partially digested with MluCI (NEB), 
and the fraction between 500 and 2,000 base pairs (bp) was ligated into the EcoRI- 
linearized pGEX-6P-1 expression vector (GE Healthcare). Ligation products were 
transformed into ElectroMAX DH10B competent Cells (Invitrogen) by electropo- 
ration. To estimate the size of the library, we cultured 1% and 0.1% of transformed 
bacteria on lysogeny broth (LB) agar plates containing 100,1g ml~! Ampicillin 
for 12h and then quantified the number of colonies. The library is estimated to 
contain 3 x 10* clones. To ensure the quality of the library, we sequenced the inserts 
of randomly picked colonies. All the sequences were mapped to the H. hepaticus 
genome, and their sizes were 700 to 1,200 bp. We aliquoted the bacteria into 96-well 
deepwell plates (Axygen) (~30 clones per well) and grew with AirPort microporous 
cover (Qiagen) in 37°C. The expression of exogenous proteins was induced by 
1mM isopropylthiogalactoside (IPTG, Sigma) for 4h. Then bacteria were col- 
lected in PBS and heat-killed by incubating at 85°C for 1h, and stored at —20°C 
until use. Two screening rounds were performed to identify the antigen-expressing 
clones. For the first round, pools of heat-killed bacterial clones were added to 
a co-culture of splenic APCs and hybridomas. Clones within the positive pools 
were subsequently screened individually against the hybridoma bait. Finally, the 
inserts of positive clones were subjected to Sanger sequencing. The sequences were 
blasted against the genome sequence of H. hepaticus (ATCC51449) and aligned to 
the annotated open reading frames. Full-length open reading frames containing 
the retrieved fragments were cloned into pGEX-6P-1 to confirm their activity in 
the T cell stimulation assay. 

Epitope mapping. We cloned overlapping fragments spanning the entire HH_1713 
coding region into the pGEX-6P-1 expression vector, and expressed these in 
Escherichia coli BL21 cells. The heat-killed bacteria were used to stimulate relevant 
hybridomas. This process was repeated until we mapped the epitope to a region 
containing 30 amino acids. The potential MHC class II epitopes were predicted with 
online software RANKPEP”. Overlapping peptides spanning the predicted region 
were further synthesized (Genescript) and verified by stimulation of the hybridomas. 
Generation of TCRtg mice. TCR sequences of HH5-1 and HH7-2 were cloned 
into the pTa and pT® vectors provided by D. Mathis*®. TCR transgenic mice were 
generated by the Rodent Genetic Engineering Core at the New York University 
School of Medicine. Positive pups were genotyped by testing TCR VB8.1/8.2 
(HH5-1tg) or VB6 (HH7-2tg) expression on T cells from peripheral blood. 
MHC class II tetramer production and staining. HH-E2 tetramer was pro- 
duced by the NIH Tetramer Core Facility*”. In brief, QESPRIAAAYTIKGA 
(HH_1713-E2), an immunodominant epitope validated with the hybridoma 
stimulation assay, was covalently linked to I-A? via a flexible linker, to produce 
pMHCII monomers. Soluble monomers were purified, biotinylated, and tetramer- 
ized with phycoerythrin- or allophycocyanin-labelled streptavidin. SFB-specific 
tetramer (3340-A6 tetramer) was described previously*. To stain endogenous 
T cells, mononuclear cells from SILP, LILP or caecal patch were first resuspended in 
MACS buffer with FcR block, 2% mouse serum and 2% rat serum. Then tetramer 
was added (10 nM) and incubated at room temperature for 60 min, and cells were 
re-suspended by pipetting every 20 min. Cells were washed with MACS buffer and 
followed by regular surface marker staining at 4°C. 

Adoptive transfer of TCRtg cells. Recipient mice were colonized with H. hepaticus 
and/or SFB by oral gavage seven days before adoptive transfer (The method for 
oral infection of SFB has been previously described®). Spleens from donor TCRtg 
mice were collected and mechanically disassociated. Red blood cells were lysed 
using ACK lysis buffer (Lonza). For TCRtg mice in wild-type background, naive 
Tg T cells were sorted as CD4*CD3*CD44'°CD62L"CD25~ V86* (HH7-2tg), 
VB8.1/8.2* (HH5-1tg) or VB14* (7B8tg) on the Aria II (BD Biosciences). For HH7- 
2tg mice bred to the Foxp3”°""? background, naive transgenic T cells were sorted as 
CD4*CD3+CD44"°CD62L™FOXP3“"?-V36". Cells were resuspended in PBS on 
ice and transferred into congenic isotype-labelled recipient mice by retro-orbital 
injection. Cells from indicated tissues were analysed two weeks after transfer. 
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H. hepaticus culture and oral infection. H. hepaticus was provided by J. Fox 
(MIT). Frozen stock aliquots of H. hepaticus were stored in Brucella broth with 
20% glycerol and frozen at —80°C. The bacteria were grown on blood agar plates 
(TSA with 5% sheep blood, Thermo Fisher). Inoculated plates were placed into a 
hypoxia chamber (Billups-Rothenberg), and anaerobic gas mixture consisting of 
80% nitrogen, 10% hydrogen, and 10% carbon dioxide (Airgas) was added to create 
a micro-aerobic atmosphere, in which the oxygen concentration was 3-5%. The 
micro-aerobic jars containing bacterial plates were left at 37°C for 5 days before 
animal inoculation. For oral infection, H. hepaticus was resuspended in Brucella 
broth by application of a pre-moistened sterile cotton swab applicator tip to the 
colony surface. The concentration of bacterial inoculation dose was determined 
by the use of a spectrophotometric optical density (OD) analysis at 600 nm, and 
adjusted to OD¢00 nm readings between 1 and 1.5. 0.2 ml bacterial suspension was 
administered to each mouse by oral gavage. Mice were inoculated every 5 days for 
a total of two doses. 

H. hepaticus-specific TCRtg cell-mediated transfer colitis. Naive T (Thaive) 
cells were isolated from the spleens of HH7-2tg mice as CD4+CD3*+CD44”° 
CD62L"'CD25~ VB6+ by FACS. The sorted cells (1 x 10°) were administered 
by retro-orbital injection into H. hepaticus-colonized Ragl~/~ mice. After 
two weeks, cells from the large intestine were isolated and analysed by flow 
cytometry. 

C. rodentium-mediated colon inflammation. C. rodentium strain DBS100 
(ATCC51459; American Type Culture Collection) was used for all inoculations. 
Bacteria were grown at 37°C in LB broth to OD¢00 nm reading between 0.4 and 0.6. 
Mice were inoculated with 200 1l ofa bacterial suspension (1 x 10°-2 x 10” colony- 
forming units (CFU)) by way of oral gavage. After 15 days, cells from the large 
intestine were isolated, stained for HH-E2 tetramer and other markers as indicated 
and analysed by flow cytometry. 

DSS-induced colitis. Mice were colonized with H. hepaticus 5 days before DSS 
treatment. To induce colitis, mice were given 2% DSS (50,000MW, Affymetrix/ 
USB) in drinking water for 2 cycles, with each exposure for 7 days with 5 days 
of untreated water in between. Control mice were given drinking water for the 
same period. Cells from the large intestine were then isolated, stained for HH-E2 
tetramer and other marks as indicated and analyzed by flow cytometry. Animal 
weights were monitored daily during the entire experiment. 

T cell culture. Naive CD4* T cells were purified from spleen and lymph 
nodes of mice with indicated genotypes. In brief, CD4* T cells were posi- 
tively selected from organ cell suspensions by magnetic-activated cell 
sorting using CD4 beads (MACS, Miltenyi) according to the product protocol, 
and then isolated as CD4*CD3*CD44!’°CD62L"CD25~ (polyclonal) or 
CD4*+CD3+CD44!°CD62LhicD25~ VB6+ (HH7-2tg) or V8.1/8.2+ (HH5-1tg) 
by FACS. T cells were cultured at 37°C in RPMI (Hyclone) supplemented with 10% 
heat-inactivated FBS (Hyclone), 50 U penicillin-streptomycin (Hyclone), 2mM 
glutamine (Hyclone), 10mM HEPES (Hyclone), 1 mM sodium pyruvate (Hyclone) 
and 50\1M 3-mercaptoethanol (Gibco). 

To generate iTeg cells for transfer colitis experiments (see below), wild-type, 
HH7-2tg or HH5-1tg cells were seeded at 1 x 10° cells in 1.5 ml per well in 12-well 
plates pre-coated with an anti-hamster IgG secondary antibody (MP Biomedicals), 
and cultured for 72h. The culture was supplemented with soluble anti-CD3< 
(0.25 jug ml“!, Bioxcell, clone 145-2C11) and anti-CD28 (1 bg ml“, Bioxcell, clone 
37.51) for TCR stimulation, and anti-IL-4 (lpg ml“, Bioxcell, clone 11B11), 
anti-IFN* (1j.g ml~!, Bioxcell, clone XMG1.2), human TGF1 (20ng ml}, 
Peprotech), human IL-2 (500 U ml~}, Peprotech) and all-trans retinoic acid 
(100 nM, sigma) for optimal iT; eg cell polarization. Aliquots of cultured cells were 
analysed for intracellular FOXP3 staining by flow cytometry. After they were con- 
firmed to be more than 98% FOXP3*, the remaining live cells (DAPI negative) 
were FACS sorted for adoptive transfer. 

To test the conditions inducing c-MAF expression, 2001] naive T cells iso- 
lated from Maf!", Maf"";Cd4" or Stat3/";Cd4"* mice were seeded at 1 x 10° 
cells per well in 96-well plates pre-coated with the anti-hamster IgG second- 
ary antibody, and cultured for 48h. The culture was supplemented with solu- 
ble anti-CD3e (0.25 ,1g ml“) and anti-CD28 (1,.g ml“) for TCR stimulation. 
Combinations of the following antibodies or cytokines were added as indicated 
in Extended Data Fig. 9c: anti- TGF@ (1 jg ml"!, Bioxcell, 1D11.16.8), human 
TGF1 (0.3 or 20ng ml !, Peprotech), human IL-2 (500 U ml!, Peprotech), 
mouse IL-6 (20ng ml~!, Thermo), mouse IL-10 (100 ng ml, Peprotech), mouse 
IL-27 (25ng ml~!, Thermo), mouse IL-12 (10 ng ml}, Peprotech), mouse IL-1 
(10ng ml |, Peprotech), mouse IL-4 (10 ng ml !, R&D systems), mouse IFN’ 
(10ng ml“!, Peprotech), and all-trans retinoic acid (100 nM, sigma). 

Treg cell in vitro suppression assay. Thaive cells with the phenotype CD4+CD3* 
CD44"CD62L"'CD25~ were isolated from the spleen and lymph nodes of 
CD45.1 wild-type B6 mice by FACS and labelled with carboxyfluorescein 
diacetate succinimidyl ester (CFSE). nTyeg (CD45.2) cells with the phenotype 
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CD4*CD3*FOXP3"°*'? +NRP1* were isolated from the spleen and lymph 
nodes of Foxp3**? or Maf*"*8 mice by FACS. B cells were isolated from the 
spleen and lymph nodes of CD45.2 wild-type B6 mice by positive enrichment for 
B220* cells using MACS LS columns (Miltenyi). 2.5 x 10* CFSE-labelled Thaive 
cells were cultured for 72h with B cell APCs (5 x 10°) and anti-CD3 (11g ml) 
in the presence or absence of various numbers of nTyeg cells as indicated. The cell 
division index of responder T cells was assessed by dilution of CFSE using FlowJo 
software (Tree Star). 

Suppression of adoptive transfer colitis with T,eg cells. To compare the suppressive 
function of c-MAF-sufficient and -deficient nTyeg cells, CD4*CD3*CD25~ 
CD45RB"™ Tyre cells were isolated by FACS from B6 mouse spleens and 
CD4*CD3*FOXP3-YFP*NPRI* nTyeg were isolated from spleen of H. hepaticus- 
colonized Foxp3°’!?? or Mafs""*8 mice. Teg cells (5 x 10°) were administered by 
retro-orbital injection into H. hepaticus-colonized Rag1~‘~ mice alone, or simul- 
taneously with 4 x 10° nTyeg cells as previously described**. Animal weights were 
measured weekly. 

To compare the suppressive function of TCRtg and polyclonal Treg cells, Thaive cells 
with the phenotype CD4+CD3*CD44"CD62L"CD25~ and V6+ (HH7-2tg) or 
CD4*CD3+CD44"°CD62L"CD25~ and V88.1/8.2* (HH5-1tg) were isolated from 
spleens of TCRtg mice. 1,000 naive HH7-2tg or HH5-1tg cells were co-transferred 
with different numbers (500,000, 150,000 or 50,000 as indicated in Fig. 4f, g) 
of in vitro polarized (see T cell culture, above) HH7-2tg, HH5-1tg or polyclonal 
iTyeg cells into H. hepaticus-colonized Rag1~!~ mice by retro-orbital injection. 

Co-housed littermate recipients were randomly assigned to different treatment 

groups such that each cage contained all treatment conditions. After four to five 
weeks (for nT;eg cell comparisons) or eight weeks (for the transgenic T cell compar- 
isons), large intestines were collected and fixed with 10% neutral buffered formalin 
(Fisher). Samples were sectioned and stained with haematoxylin and eosin (H&E) 
by the Histopathology Core at the New York University School of Medicine. 
Histology analysis. The H&E slides from each sample were examined in a blinded 
fashion. Samples of proximal, mid and distal colon were graded semi-quantitatively 
from 0 to 4 as described previously*’. Scores from proximal, mid and distal sites 
were averaged to obtain inflammation scores for the entire colon. 
Cell isolation for RNA-seq experiment. A T cell reconstitution system was 
designed to purify c-MAF-sufficient or -deficient iT;eg cells from compatible 
microenvironments. In brief, Thaive cells were isolated from the spleen of 
CD45.2 Foxp3°’*? or Maf*"*8 mice as CD4*CD3+CD44"°CD62L"FOXP3- 
YFP and Tyeg cells were isolated from the spleens of CD45.1 wild-type B6 mice 
as CD4*CD3*CD25* by FACS. Thaive cells (2 x 10°) and Treg cells (8 x 10°) 
were simultaneously administered by retro-orbital injection into H. hepaticus- 
colonized Ragl ~'~ mice. Two weeks after transfer, c-MAF-sufficient or -deficient 
iTyeg cells were purified from the large intestine of reconstituted mice as 
CD4*CD3*CD45.1~- CD45.2*FOXP3-YFP* by FACS and collected into FBS. 
20% of the sorted cells were stained for RORyt and FOXP3 and the remaining 
cells were saved in TRIzol (Invitrogen) for RNA extraction. 

To isolate H. hepaticus-specific colitogenic Tes cells, HH7-2tg;Maf*"8 and 
HH7-2tg;Foxp3 mice were colonized with H. hepaticus. HH7-2tg;Foxp3° 
mice were further injected intraperitoneally with 1 mg anti-IL-10RA antibody 
(clone 1B1.3A, Bioxcell) weekly from the day of colonization. HH7-2 Ter cells 
(CD3*CD4*TCRV6*FOXP3~ YEP” ) were sorted from the LILP two weeks after 
colonization. Homeostatic IL-23R-GFP* T cells (CD3*CD4*IL-23R-GFP*) were 
sorted from both SILP and LILP of 1/23r°""* mice stably colonized with SFB. 
RNA-seq library preparation. Total RNA was extracted using TRIzol (Invitrogen) 
followed by DNase I treatment and cleanup with RNeasy MinElute kit (Qiagen). 
Treg cell RNA-seq libraries were prepared with the SMART-Seq v4 Ultra Low Input 
RNA Kit (Clontech 634899 and 634888). T}y17 cell RNA libraries were prepared 
using the Nugen Ovation Ultralow Library Systems V2 (7102 and 0344). All 
sequencing was performed using the Illumina NextSeq. RNA-seq libraries were 
prepared and sequenced by the Genome Technology Core at New York University 
School of Medicine. 

Data processing of RNA-seq. RNA-seq reads were mapped to the Mus musculus 
genome Ensembl annotation release 87 with STAR (v2.5.2b)*°. Uniquely mapped 
reads were counted using featureCounts" with parameters: -p -Q 20. DESeq2” was 
used to identify differentially expressed genes across conditions with experimental 
design: ~condition + gender. Read counts were normalized and transformed by 
functions varianceStabilizingTransformation and rlog in DESeq2 with the follow- 
ing parameter: blind=FALSE. Gender differences were considered as batch effect, 
and were corrected by ComBat**. Downstream analysis and data visualization 
were performed in R“. 

Statistical analysis. For animal studies, mutant and control groups did not always 
have similar standard deviations and therefore an unpaired two-sided Welch's t-test 
was used. Error bars represent + s.d. Animal sample size estimates were determined 
using power analysis (power = 90% and a = 0.05) based on the mean and s.d. 
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from our previous studies and/or pilot studies using 4-5 mice. No samples were 
excluded from analysis. For RNA-seq analysis, differentially expressed genes were 
calculated in DESeq2 using the Wald test with Benjamini-Hochberg correction to 
determine the FDR. Genes were considered differentially expressed with FDR < 0.1 
and log, fold change > 1.5. Enriched disease pathways in pathogenic HH7-2 Ty17 
were determined using Ingenuity Pathway Analysis (https://www.ingenuity.com). 
Gene set enrichment analysis (GSEA, http://www.broad.mit.edu/gsea/) on Maf- 
deficient or -sufficient iT;eg cells was performed using a gene set of 33 RORt- 
dependent genes in NRP1~ colonic Treg cells (Rorc, Ccr6, Idua, Illrn, C2cd4b, Nxt1, 
Tmem176b, Cxcr3, Tnfrsfla, Adamts7, Pik3ip1, Rrad, Crmp1, Irak3, Fam129b, Ppcs, 
Tbxa2r, Avpil, Serpinbla, Alkbh7, Nckipsd, Haver2, 1123r, Txnip, Igj (also known as 
Ichain), Trim16, Pigp, Rras, Samd10, Il1r2, F2rl1, Maffand Ly6c1)"". 

Data availability. cDNA sequences of H. hepaticus-specific TCRs yielding data 
that support the findings of this study have been deposited in GenBank with the 
accession codes KY964547-KY 964570. RNA-seq data have been deposited in the 
Sequence Read Archive (SRA) with accession code SRP126932, and in the Gene 
Expression Omnibus (GEO) with accession code GSE108184. All other data are 
available from the corresponding author upon reasonable request. 
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Extended Data Figure 1 | Cloning and characterization of H. hepaticus- _ dz, In vitro activation of CFSE-labelled naive HH7-2tg and HH5-1tg cells by 
specific Ty17 TCRs, and generation of TCRtg mice and MHC class indicated stimuli in the presence of antigen-presenting cells. Data are from 
II tetramers. a, [L-23R-GFP expression in CD4* T cells from the large one of two independent experiments. e, f, Expansion of donor-derived 
intestines of mice with and without H. hepaticus colonization and after HH7-2tg (e) and HH5-1tg (f) (CD45.2) cells in the large intestine of 
IL-10RA blockade. Data are from one of five independent experiments. H. hepaticus-colonized or -free CD45.1 mice, gated on total CD4* T cells. 
b, Experimental scheme for cloning H. hepaticus-induced single Data are from one of three independent experiments. g, HH-E2 tetramer 
IL-23R-GFP* (predominantly Ty17) cell TCRs under IL-10RA blockade. staining of CD4* T cells from the large intestine of H. hepaticus-colonized 
c, Summary of the 12 dominant H. hepaticus-induced Ty17 TCRs. or -free mice. Data are from one of six independent experiments. 
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Extended Data Figure 2 | Extended characterization of SFB- and 

H. hepaticus-specific T cells in distinct anatomical sites in bacteria- 
colonized wild-type mice. a, Representative flow cytometry plots of 
donor-derived HH7-2tg (CD45.1/45.2) and 7B8tg (CD45.2/45.2) T cells 
in indicated tissues of mice colonized with SFB and H. hepaticus, gated on 
total CD4* T cells (CD4*CD3*) (n= 15). b, Proportions of donor-derived 
HH7-2tg and 7B8tg T cells among total CD4* T cells in indicated tissues. 
Data in a and b are from 1 of 3 experiments, with a total of 15 mice in the 
3 experiments. c, Representative flow cytometry plots of RORyt, FOXP3, 
BCL6 and CXCRS expression in CD4* T cells from the host and from 
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HH/7-2tg and 7B8tg donors in different tissues (n = 15). d, Frequencies of 
Treg (FOXP3*), TH17 (FOXP3” RORYt*) and Try (BCL6*CXCR5*) cells 
among donor-derived HH7-2tg and 7B8tg cells in different tissues. Data 
are from 1 of 3 experiments, with a total of 15 mice in the 3 experiments. 
e, Representative flow cytometry plots of FOXP3, RORyt, GATA3 and ST2 
expression in CD4* T cells from the host (blue) and from HH7-2tg donors 
(red) in the LILP (n =5). f, Representative flow cytometry plots of FOXP3, 
ROR4t, GATA3 and ST2 expression in total CD4* (green) and HH-E2 
tetramer (red) T cells in the LILP (n =5). Spl, spleen. 
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Extended Data Figure 3 | Extended characterization of H. hepaticus- 
specific TCRtg cell differentiation. a, HH7-2tg;Ragl ~'~ mice do not 
develop Treg cells in the thymus. Representative flow cytometry plots of 
Treg (FOXP3*CD25*) frequency in indicated tissues of H. hepaticus-free 
HH7-2tg;RagI*/~ (n= 3) or HH7-2tg;Rag1~/~ (n=3) mice. b, c, HH7- 
2tg;Rag1~'~ and HH7-2tg;Rag1*!~ donor-derived T cells differentiated 
into equal frequencies of ROR t* Treg cells in the large intestine of 
wild-type mice. Equal numbers (2,000) of congenic isotype-labelled 
HH7-2tg;Rag1*!~ (CD45.1/45.1) and Rag1~'~ (CD45.1/45.2) naive 

T cells were co-transferred into H. hepaticus-colonized wild-type 

B6 mice. Cells from the LILP were analysed two weeks after transfer. 
Data summarize two independent experiments (n = 6). b, Representative 
flow cytometry plots of donor and recipient T cell frequency (left), and 
ROR yt and FOXP3 expression (right) (n = 6). c, Frequencies of Treg 


CXCR5-PE ROR t-APC CXCR5-PE 


T,,17: FOXP3- RORyt+ 
T,,: FOXP3- BCL6+ CXCR5+ 


(FOXP3*), Ty17 (FOXP3” RORYt*) and Tpy (BCL6*CXCRS5*) cells 
among HH7-2tg Rag]*!~ (n=6) and Rag1~/~ (n=6) donor-derived 

T cells. d, e, Approximately 2,000 naive HH5-1tg cells (CD45.1/45.2) were 
adoptively transferred into wild-type B6 mice (CD45.2/45.2) colonized 
with H. hepaticus. Cells from LILP and the caecal patch were analysed two 
weeks after transfer. d, Representative flow cytometry plots are shown 

for RORyt, FOXP3, BCL6 and CXCRS expression in donor-derived and 
recipient CD4* T cells in the indicated tissues. e, Frequencies of Treg 
(FOXP3*), Ty17 (FOXP3” ROR4t*) and Tpy (BCL6*CXCR5*) among 
HH5-1tg donor T cells (n =8). Data are a summary of eight mice from two 
independent experiments. All statistics were calculated by unpaired 
two-sided Welch’s t-test. Error bars denote mean + s.d. P values are 
indicated on the figure. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | Differentiation of SFB- and H. hepaticus- 
specific T cells in 1110*/~ and I110~'~ mice. a—d, Equal numbers 
(10,000) of congenic isotype-labelled HH7-2tg (CD45.1/45.2) and 7B8tg 
(CD45.1/45.1) T cells were co-transferred into I110~/~ and II10*/~ mice 
(CD45.2/45.2) colonized with both H. hepaticus and SEB. Intestinal T cells 
were examined two weeks later. a, Representative flow cytometry plots 

of FOXP3, RORyt and T-bet expression in total and FOXP3~ host CD4+ 
T cells in the SILP and LILP of I/10*/~ (n= 10) and I/10~/~ (n=8) mice 
that received TCR transgenic T cell transplants. b, Frequencies of Treg 
(FOXP3*) and Ty17 (FOXP3” ROR}t*) cells among SILP 7B8tg donor- 
derived cells in 1110+/~ (n= 10) and I110~/~ (n=8) mice. Data for a and 

b are a summary of four independent experiments. c, Representative flow 
cytometry plots of IL-10, IL-17A and IFNy expression in transferred 7B8tg 
and HH7-2tg cells from LILP and SILP of 1110*/~ and I110~'~ mice after 
re-stimulation (n=5 or 6). d, Proportions of transferred 7B8tg and 
HH7-2tg cells in the SILP and LILP of 1/10*/~ and I110~/~ mice that 
express IL-10, IL-17A and IFN‘ after re-stimulation (1 =5 or 6). Data for 
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cand d are a summary of two independent experiments. e, 2,000 naive 
HH5-1tg cells (CD45.1/45.2) were adoptively transferred into I110+/~ 
and [110~'~ mice colonized with H. hepaticus. Cells from the LILP were 
analysed two weeks after transfer (1 = 5). Representative flow cytometry 
plots of RORyt and FOXP3 expression in HH5-1tg donor cells are shown 
(left), along with a compilation of frequencies of Teg (FOXP3*) and Ty17 
(FOXP3- ROR‘t*). f, g, RORyt and FOXP3 expression in total CD4* 

and HH-E2 tetramer T cells (f) and frequencies (above) and absolute 
numbers (below) of Tyeg (FOXP3*) and Ty17 (FOXP3” ROR}t*) cells 
among HH-E2 tetramer* T cells (g) in the LILP of I/10*/~ (day 25, n=8) 
and I110~'~ (day 12 n=8, day 25 n=7, day 42 n=8) mice colonized with 
H. hepaticus for indicated times. All statistics were calculated by unpaired 
two-sided Welch’s t-test. Error bars denote mean + s.d. P values are as 
follows: b, i= 0.062 and ii =0.063. e, i= 1.46 x 10-7 and ii=3.10 x 10-4. 
g, Top, i=7.82 x 10~4, ii=0.014, iii = 0.088, iv= 1.48 x 1074, 

v=1.47 x 10-3 and vi=0.016; bottom, i=3.85 x 10~°, ii=9.63 x 107”, 
iii=1.31 x 10°, iv=8.91 x 10-7, v=1.15 x 10° and vi=1.56 x 1077. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Differentiation of H. hepaticus-specific T cells 
in colitis models. a, b, Naive HH7-2tg T cells were adoptively transferred 
into H. hepaticus-colonized Rag1~/~ mice to induce colitis (n =7). Data 
summarize two independent experiments. Representative expression of 
FOXP3, RORt, and T-bet (a), and a compilation of frequencies of Treg 
(FOXP3*), Ty17 (FOXP3” RORt*), Ty1 (FOXP3~ T-bet*) and Ty17/ 
Tul (FOXP3” ROR4t*T-bet*) cells in HH7-2tg donor-derived cells in the 
LILP of recipient mice was analysed 4 weeks after transfer. c-e, Analysis of 
H. hepaticus-specific T cell differentiation during C. rodentium-induced 
colonic inflammation. Data summarize two independent experiments. 

c, Schematic of experimental design. d, e, Representative flow cytometry 
plots of FOXP3, RORyt and T-bet expression in total CD4* and HH-E2 
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tetramer* T cells (d) and frequencies of T,eg (FOXP3*) and Ty17 
(FOXP3” ROR t*) cells among HH-E2 tetramer* T cells (e) in the LILP of 
C. rodentium-infected (n= 5) and -uninfected mice (n =5). f-h, Analysis 
of H. hepaticus-specific T cell differentiation during DSS-induced colitis. 
Data are a summary of two independent experiments. f, Schematic of 
experimental design. g, h, Representative flow cytometry plots of FOXP3, 
RORvt and T-bet expression in total CD4* and HH-E2 tetramer* T cells 
(g) and a compilation of frequencies of Treg (FOXP3*) and Ty17 
(FOXP3”- ROR t*) cells among HH-E2 tetramer* cells (h) in the LILP 

of DSS-treated (n = 10) and -untreated mice (n = 10). All statistics 

were calculated by unpaired two-sided Welch's t-test. Error bars denote 
mean + s.d. P values are indicated on the figure. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Extended characterization of Maf*"%, 
Rorc4™ and Gata34™% mice. a, Expression of c-MAF in the indicated 
CD4* T cell subsets in the LILP. b, Incomplete depletion of c-MAF protein 
in ROR}t* Treg cells in Maf*"® mice shown by a representative flow 
cytometry graph from 3 independent experiments (left), and a compilation 
of mean fluorescence intensities (MFI) in RORYt™ Treg cells and residual 
ROR tT Treg cells (right). c, Absolute numbers of indicated CD4* 

T cell populations in the LILP of indicated mice. Data are a summary of 

3 independent experiments for Rorc*"8 (n=7) and littermate controls 
(n=7) and 4 independent experiments for Maf*""8 (n= 11) and 
littermate controls (n = 8). d, e, Representative flow cytometry plots of 
FOXP3, RORyt and GATA3 expression in total and FOXP3+ CD4+ 

T cells (d) and a compilation of frequencies of RORyt* and RORYt™ Treg 
(FOXP3*) cells and Ty17 (FOXP3” ROR}t*) cells among total CD4* 

T cells (e) in the LILP of Gata34" (n = 8) and littermate controls (n=7). 
Data summarize two independent experiments. f, Absolute numbers of 
indicated HH-E2 tetramer* T cell populations in the LILP of indicated 
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mice. Data are a summary of 3 independent experiments for Rorc*™8 


(n=7) and littermate controls (mn =6) and 4 independent experiments for 
Maf*"8 (n= 11) and littermate controls (n = 8). g, Representative flow 
cytometry plots of Try markers BCL6 and CXCR5 among total CD4* 
and HH-E2 tetramer* cells from the caecal patch of Maf "8 mice and 
littermate controls (n = 4). h, i, SFB-specific T cells did not adopt the pro- 
inflammatory Ty17-Ty1 phenotype or expand in Maf*""8 mice. Data 
summarize two experiments, Mafsts (n=5) and littermate controls 
(n=6). Representative flow cytometry plots of FOXP3, RORyt 

and T-bet expression in total CD4* and SFB-tetramer™ T cells (h) and 
absolute number of SFB-tetramer” cells (i) in the SILP. All statistics 

were calculated by unpaired two-sided Welch's t-test. Error bars denote 
mean + s.d. P values are indicated on the figure or as follows: c, i=0.42, 
ii=0.73, iti = 6.38 x 10-°, iv=2.28 x 1074, v=7 x 1071), vi=7.10 x 10°%, 
vii = 2.99 x 10-* and viii =0.83. e, i= 0.081, ii=0.102 and iii=0.16. 

f, i=0.65, ii=0.41, iti =0.045, iv=0.12, v=0.29 and vi=6.28 x 107°. 
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Extended Data Figure 7 | See next page for caption. 
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Extended Data Figure 7 | Analysis of c-MAF function in RORyt* 

iTreg cells. a~d, Equal numbers of congenic isotype-labelled naive 
Maf*!*;Foxp3*" (ctrl, CD45.1/45.2) and Maf“"; Foxp3* (CD45.2/45.2) 
HH7-2tg cells were co-transferred into H. hepaticus-colonized wild-type 
CD45.1 mice. Cells from the LILP, mLNs and spleen were analysed 15 days 
after transfer. a, Schematic of experimental design. b, Flow cytometry plot 
depicting ratio of pooled co-transferred naive T cells before transfer. 

c, d, Left, ratios of Mafs™8 versus control HH7-2tg donor-derived 

cells in the mLNs and spleen (n = 10). Dashed line represents ratio of 
co-transferred cells before transfer. Right, frequencies of T,.g (FOXP3*) 
and Ty17 (FOXP3” RORYt*) cells among donor-derived cells (n = 10). 
Statistics were calculated by unpaired two-sided Welch’s t-test. Error bars 
denote mean + s.d. P values are indicated on the figure. e-h, Isolation of 
Maf-deficient and -sufficient iT;eg cells for RNA-seq through a 
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T cell reconstitution system. Two replicates represent two independent 
experiments. e, Schematic of experimental design. f, Flow cytometry plots 
indicating the sorting gates from two independent experiments. g, Flow 
cytometry plots showing FOXP3 and ROR expression in sorted FOXP3- 
YFP* cells from two independent experiments. h, Gene set enrichment 
analysis performed on RNA-seq dataset of c-MAF-sufficient versus 
-deficient iTyeg (FOXP3-YFP*) cells (1 = 2 independent experiments) 
with a gene set of 33 ROR t-dependent transcripts identified previously’. 
i, Top, representative flow cytometry plot of c-MAF expression in Ty17 
cells (FOXP3~ RORt*) from LILP of control (black) and Maf3™8 

(red) mice. The c-MAF negative population is defined by gating on 
FOXP3*ROR}t™ Treg cells from Maf*" mice (solid grey). Bottom, 
frequency of c-MAF expression in Ty17 cells in control (n = 6) and 
Maf*"8 (n= 9) mice from 3 independent experiments. 
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Extended Data Figure 8 | Transcriptional profiling of conventional 
Ty17 and H. hepaticus-specific T effector cells. a-f, RNA-seq was 
performed on 2 biological replicates of each indicated condition. a, Flow 
cytometry analysis of HH7-2tg Tor cells from H. hepaticus-colonized 
mice and conventional IL-23R-GFP* (predominantly SFB-specific Ty17) 
cells from SFB-colonized mice. GFP* gates in the bottom panel were 
used for sorting to perform RNA-seq. b, Principal component analysis 
of RNA-seq data from sorted cell populations. Coloured dots represent 
individual samples (n = 2). ¢, e, f, Differentially expressed genes were 
calculated in DESeq2 using the Wald test with Benjamini-Hochberg 
correction to determine the FDR. Genes were considered differentially 
expressed when FDR < 0.1 and log: fold change > 1.5. c, Venn diagram 
depicting differentially expressed genes between indicated comparisons. 
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d, Significantly enriched disease pathways in the set of 149 shared genes 
upregulated in HH7-2tg Maf*""8 and HH7-2tg from anti-IL-10RA- 
treated mice compared to conventional large intestine Tj17 cells. P values 
calculated by ingenuity pathway analysis using Fisher’s exact test. e, 
Comparison of transcriptomes of H. hepaticus-specific Ty17 cells from 
mice treated with IL-10RA blockade or Maf*""8 and conventional Ty17 
cells. Scatter plot depicting log, fold change of gene expression. Blue, red 
and purple dots indicate significant differences. f, Heat map depicting the 
347 shared genes differentially expressed between pathogenic HH7-2 and 
conventional Ty17 cells (purple dots in e). Data for each condition are 
the mean of 2 biological replicates. Scale bar represents z-scored variance 
stabilized data (VSD) counts. 
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Extended Data Figure 9 | STAT3 and TGF6 signal synergistically to 
promote c-MAF expression. a, Top, representative flow cytometry plots 
depicting RORyt and FOXP3 expression in CD4* T cells in the LILP 

of mice with a dominant-negative mutant of TGF receptors (CD4- 
dnTGFbRII) and littermate controls (n = 3). Bottom, representative plot 
of c-MAF expression in FOXP3* cells from the indicated mice in the 
top panel. b, Top, representative flow cytometry plots depicting RORyt 
and FOXP3 expression in CD4* T cells in the LILP of Stat3l!",Cd4¢ and 
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Stat3/" littermate controls (n= 4). Bottom, representative plot of c- MAF 
staining in FOXP3* cells from indicated mice in the top panel. c, Mean 
fluorescence intensity of c-MAF staining in in vitro differentiated CD4* 
T cells. Naive CD4* T cells from wild-type, Stat3™".Cd4" and 
Mafl"';cda’ mice were activated for 48 h with anti-CD3¢/anti-CD28 
antibodies under indicated conditions. Dashed line represents the MFI 
of c-MAF in Maf';cda’ T cells. Data are from one of two independent 
experiments. 
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b, Activity of nTyeg cells in the transfer-mediated colitis model. Percentage 
weight change (left) and colitis histology scores (right) of Ragl ~!— mice 
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adoptively transferred with naive T cells alone (n = 8), or naive T cells 

in combination with nT,.g cells from Maf*"*s (n= 10) or littermate 
control Foxp3°*!? (n =9) mice. Data are a summary of two independent 
experiments. All statistics were calculated by unpaired two-sided Welch's 
t-test. Error bars denote mean + s.d. P values are indicated on the figure. 
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Asparagine bioavailability governs metastasis in a 


model of breast cancer 


Simon R. V. Knott!*3*, Elvin Wagenblast**+>*, Showkhin Khan*®, Sun Y. Kim?, Mar Soto”, Michel Wagner’, 
Marc-Olivier Turgeon’, Lisa Fish®?, Nicolas Erard!, Annika L. Gable?, Ashley R. Maceli*, Steffen Dickopf?, 
Evangelia K. Papachristou!, Clive S. D’Santos!, Lisa A. Carey", John E. Wilkinson”, J. Chuck Harrell!’, Charles M. Perou", 


Hani Goodarzi®’", George Poulogiannis’ & Gregory J. Hannon 


Using a functional model of breast cancer heterogeneity, we 
previously showed that clonal sub-populations proficient at 
generating circulating tumour cells were not all equally capable of 
forming metastases at secondary sites!. A combination of differential 
expression and focused in vitro and in vivo RNA interference screens 
revealed candidate drivers of metastasis that discriminated metastatic 
clones. Among these, asparagine synthetase expression in a patient’s 
primary tumour was most strongly correlated with later metastatic 
relapse. Here we show that asparagine bioavailability strongly 
influences metastatic potential. Limiting asparagine by knockdown 
of asparagine synthetase, treatment with L-asparaginase, or dietary 
asparagine restriction reduces metastasis without affecting growth of 
the primary tumour, whereas increased dietary asparagine or enforced 
asparagine synthetase expression promotes metastatic progression. 
Altering asparagine availability in vitro strongly influences invasive 
potential, which is correlated with an effect on proteins that promote 
the epithelial-to-mesenchymal transition. This provides at least one 
potential mechanism for how the bioavailability of a single amino acid 
could regulate metastatic progression. 

Most women with breast cancer do not succumb to their primary 
tumour but instead to metastases that become apparent after the 
primary lesion has been removed. For cells to contribute to metastases, 
they must leave the primary site, enter the vasculature, survive in the 
blood, and then extravasate and colonize secondary sites. Our previous 
studies of a mouse model of breast tumour heterogeneity identified 
two clonal 4T1 sub-lines with a strong propensity to form circulating 
tumour cells (CTCs) through a non-invasive mechanism requiring 
vascular mimicry (4T1-E and -T)!”. These two clones differed in their 
ability to form metastases, with 4T1-T preferentially colonizing brain, 
liver, and lungs. The distinction between the metastatic potential of the 
two CTC-forming clones offered the opportunity to identify drivers 
of metastasis, which exert their effects late in metastatic progression. 

To validate the observation that 4T1-T had greater metastatic poten- 
tial among CTC-proficient clones, we combined equal numbers of 
4T1-Eand -T cells and introduced these directly into the bloodstream 
of immune-compromised recipients (NOD-SCID-II2rg~/~ (NSG) 
mice). Although the two clones were initially present in equal abun- 
dance (Fig. 1a), when cells were collected from the lung at day 7, clone T 
predominated, with its relative representation being inversely correlated 
with the total number of cells injected. 

We identified 192 genes with higher expression in 4T1-T than 4T1-E 
cells (Supplementary Table 1). Their corresponding Gene Ontology 


1,2,6 


terms were enriched for processes important for metastatic spread 
(Supplementary Table 2; for example, cell migration and locomotion). 
A retrospective analysis of patient data showed that genes within the 
set are more highly expressed in aggressive breast tumour subtypes* 
(Extended Data Fig. la). They were also more highly expressed in 
the primary tumours of patients with later relapse to the bone, brain, 
and lungs compared with primary tumours of relapse-free survivors 
(Extended Data Fig. 1b for lung). 

To identify metastatic drivers, we performed an RNA interference 
(RNAi) screen, with two arms (Fig. 1b). In total, 26 pools of approx- 
imately 40 short hairpin RNAs (shRNAs), targeting protein-coding 
members of the 192-gene set, were introduced into 4T1-T cells*. These 
were collected onto Matrigel or introduced into NSG mice by tail vein 
injection. After 24h, the cells that had invaded through the Matrigel 
were collected and, after 7 days, lungs were collected from the mice. 
Using high-throughput sequencing, we identified shRNAs that were 
depleted from the invaded cell populations or lung metastases, pre- 
sumably because they targeted genes important for these processes. 
Strong overlap was observed when the in vitro and in vivo candidates 
were compared (Fig. 1c and Supplementary Table 3). 

Of the 11 candidate genes that scored in both the in vitro and 
in vivo assays, asparagine synthetase (Asns) had the most robust clinical 
evidence supporting its relevance to disease progression (Extended 
Data Fig. 1c) Expression levels of the human orthologue, ASNS, 
were predictive of general and lung-specific relapse in two datasets of 
patients with breast cancer. Also, when a small collection of matched 
tumour and lung metastases were transcriptionally profiled, ASNS was 
found to be more highly expressed in secondary lesions. ASNS is more 
highly expressed in aggressive tumour subtypes (Extended Data Fig. 1d) 
and it is more highly expressed in patients with relapse to the lymph 
node, brain, liver, and lungs compared with relapse-free survivors 
(Extended Data Fig. le). Subsequent analyses identified ASNS as 
predictive of survival in three additional datasets of patients with 
breast cancer (Extended Data Fig. 1f). In addition to breast tumours, 
ASNS is negatively correlated with survival in four out of the ten other 
solid tumours in The Cancer Genome Atlas (TCGA) Pan-Cancer 
dataset (Extended Data Fig. 1g) and is a globally predictive biomarker 
for solid tumours (Extended Data Fig. 1h). Previous research has shown 
that asparagine is an important regulator of cellular metabolism and 
adaptation>®. 

To validate Asns as a metastatic driver, we infected 4T1-T cells 
with two shRNAs targeting Asns or a control and introduced these 
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Figure 1 | Identification of metastatic drivers. a, Relative proportions 
of 4T1-E and -T cells extracted from the lungs of NSG mice, into which 
mixtures of cells were introduced via tail vein at different concentrations. 
Each bar represents a sample or independent mouse. b, RNAi screening 
scheme to identify drivers of invasion in vitro and extravasation and 
colonization in vivo (n=5 mice or n=2 Matrigel six-well invasion 
chambers per approximately 50-construct shRNA pool, gene-level hit calls 
with empirical Bayes-moderated t-test false discovery rate < 0.05 and 0.1 
for in vivo and in vitro screens, respectively). c, Overlap between genes 
identified in each arm of the RNAi screen depicted in b (hypergeometric 
test P< 0.01). 


cells intravenously into NSG mice (Supplementary Table 4). Asns- 
silenced cells produced significantly fewer lung metastases (Fig. 2a and 
Extended Data Fig. 2a). Asns-silenced cells also showed poor invasion 
into Matrigel (Fig. 2b and Extended Data Fig. 2b). Silencing Asns did 
affect proliferation in vitro; however, this defect was minor compared 
with that observed in the invasion assay (Extended Data Fig. 2c, d). 
Intracellular free asparagine was reduced by silencing of Asns in 4T1- 
T cells (Extended Data Fig. 2e), and the abilities to invade and 
proliferate were increased in Asns-silenced cells when the culture 
medium was supplemented with asparagine (Extended Data Fig. 2f, g). 

When Asns-silenced cells were injected into the mammary fat pad, no 
significant change in primary tumour formation was observed (Fig. 2c), 
yet CTCs and lung metastases were reduced (Fig. 2d, e). Although 
statistically insignificant, metastases initiated by silenced cells were 
noticeably smaller, hinting at a growth defect at the metastatic site 
(Fig. 2f). Similar results were obtained with Asns-silenced parental 4T1 
cells, indicating that Asns dependency is not a peculiarity of a single 
clonal line (Extended Data Fig. 3a, b). Enforced Asns expression in 
parental 4T1 populations did not affect primary tumour growth but 
did increase metastases both in number and in size (Extended Data 
Fig. 3c-e). Similar outcomes were observed upon enforced ASNS 
expression in human MDA-MB-231 breast cancer cells (Extended 
Data Fig. 3f-i). 

To determine whether the observed effects were unique to aspar- 
agine, we supplemented the medium separately with other non- 
essential amino acids (NEAAs) lacking in the culture medium, or with 
glycine, which is present and acted as a negative control, and assayed 
cells for invasiveness. 4T1 cells responded uniquely to asparagine sup- 
plementation, with an approximately twofold increase in invasiveness 
(Fig. 3a), although levels of uptake were similar for each of the amino 
acids with the exception of aspartic and glutamic acids (Extended Data 
Fig. 4a). More profound effects were observed with MDA-MB-231 cells 
(Extended Data Fig. 4b, c). Growth was not affected by asparagine sup- 
plementation for either cell line during the same period (Extended 
Data Fig. 4d, e). 
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Figure 2 | Validation of Asns as a driver of invasion and metastasis. 

a, Quantification of metastases in the lungs of mice intravenously injected 
with Asns-silenced or -expressing 4T1-T cells (n= 10 mice per cell line, 
edges of the box are the 25th and 75th percentiles, and error bars extend to 
the values q3 + w(q3 — qi) and q; — w(q3 — q1), in which w is 1.5 and q; and 
qs are the 25th and 75th percentiles, which is also true for d-f, rank-sum 
test P< 0.001). b, Representative images of collection wells of the Matrigel 
assay after Asns-silenced and -expressing cells were applied 24h previously 
(n=3 invasion chambers per cell line). c, Tumour volumes resulting from 
the orthotopic injection of the cells described in a (n= 10 mice per cell line). 
d, Relative abundance of CTCs in animals corresponding to the tumours 
described in c (n=6 mice per cell line, rank-sum P< 0.05 for shAsns-2). 

e, Quantification of metastases in H&E-stained lung sections, from mice 
described in d (rank-sum P < 0.0002). f, Average diameters of the metastases 
of each mouse described in e. See Source Data. 


Because invasiveness could be modulated either by altering aspar- 
agine biosynthetic capacity or by modifying extracellular pools, we 
investigated whether metastasis could be influenced by treatment 
with L-asparaginase. This enzyme is used to treat acute lymphoblastic 
leukaemia, which is generally highly dependent on extracellular 
asparagine’®. L-Asparaginase has proved ineffective for treating 
solid tumours, consistent with Asns silencing not affecting growth 
at the primary site’. NSG mice harbouring orthotopic 4T1 tumours 
were treated with 60 U L-asparaginase 5 times per week for 19 days, 
reducing serum asparagine to undetectable levels (Supplementary 
Table 5). Although no significant difference was detected in primary 
tumours, compared with controls, a reduction in metastasis was 
observed (Extended Data Fig. 5a-c). 

In TEL-AML1-negative patients with acute lymphoblastic leukaemia, 
resistance to L-asparaginase is thought to be achieved through 
increased biosynthetic production of asparagine'’. When Asns-silenced 
cells were injected orthotopically into mice treated with L-asparaginase, 
metastases were nearly undetectable (Fig. 3b and Extended Data 
Fig. 5d). In this case, a reduction of primary tumour volume was also 
observed (Extended Data Fig. 5e). Similar results were obtained with 
ASNS-silenced MDA-MB-231 cells (Extended Data Fig. 5f, g). 

The availability of extracellular asparagine can also be manipu- 
lated by altering asparagine levels in the diet. shRNA-infected 
4T1-T cells were orthotopically injected into mice that received a 
control, low-asparagine, or high-asparagine chow (0.6%, 0%, and 
4%, respectively). High-performance liquid chromatography (HPLC) 
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Figure 3 | Extracellular asparagine availability drives invasion and 
metastasis. a, Quantification of parental 4T1 cell invasion rates, as measured 
by the Matrigel invasion assay, in culture medium supplemented with the 
indicated NEAAs (n= 5 invasion chambers, rank-sum P< 0.01). 

b, Representative H&E-stained lung sections from animals injected with 
Asns-silenced or -expressing 4T1-T cells. Animals were administered 
L-asparaginase or PBS (n = 10 mice per condition). c, Quantification of lung 
metastases in animals injected with Asns-silenced or -expressing 4T1-T cells. 
Animals were administered a diet with either 0%, 0.6%, or 4% asparagine 
content for the duration of the experiment (n= 10 mice per condition, rank- 
sum P <0.05 for Asns-silenced versus -expressing cells across all diets, for 
each cell line with 0% versus 4% diets, for shRenilla and shAsns-1 infected 
cells with 0% versus 0.6% diet, and for unsilenced cells with 0.6% versus 

4% diet). d, Mass-spectrometric quantification of the asparagine levels in 
the mammary gland, blood serum, and lungs of animals administered 
L-asparaginase or PBS (relative abundance normalized by total metabolite 
peak area, n > 8 tissue sections per condition, rank-sum P< 0.005 for PBS 
versus L-asparaginase across all tissues, rank-sum P < 0.05 for mammary 
gland versus lung, and rank-sum P < 0.0005 for serum versus lung and 
serum versus mammary gland). See Source Data. 


confirmed that serum asparagine levels were significantly altered in 
concordance with dietary intake (Extended Data Fig. 6a). Asparagine 
restriction did not affect primary tumour growth, regardless of Asns 
expression status (Extended Data Fig. 6b). By contrast, metastatic 
burden was decreased in animals that were fed low-asparagine diets 
and increased in animals given high-asparagine diets (Fig. 3c and 
Extended Data Fig. 6c). Metastases were nearly undetectable in mice 
that were injected with Asns-silenced cells and fed a low-asparagine 
diet. Similar results were obtained when parental 4T1 cells were ortho- 
topically injected into animals fed these same asparagine-controlled 
diets (Extended Data Fig. 6d-f). 

Analyses of the mammary gland, serum, and lungs of mice by mass 
spectrometry suggested that, under normal physiological conditions, 
asparagine levels are highest in mammary gland and lowest in serum 
(Fig. 3d). High asparagine availability in the mammary gland might 
blunt the effect of Asns silencing or changes in global asparagine lev- 
els on primary tumour growth, whereas low levels in the serum may 
make CTCs susceptible to these manipulations. Overall, asparagine 
abundance in tissues correlated with Asns expression (Extended Data 
Fig. 6g). ASNS expression levels follow a similar pattern across human 
tissues (Extended Data Fig. 6h), raising the possibility of similar effects 
if asparagine levels were altered in patients!!. 

To understand the mechanism by which asparagine availability 
might affect invasion and metastasis, we examined expression changes 
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Figure 4 | Asparagine availability regulates EMT. a, Relative protein 
abundances of EMT-up proteins in Asns-silenced and -expressing 4T1- 
T cells (n=3 replicates per cell line, edges of the box are the 25th and 
75th percentiles, and error bars extend to the values q3 + w(q3 — qi) and 
qi — w(q3 — qi), in which w is 1.5 and q, and q3 are the 25th and 75th 
percentiles, which is also true for b-d, rank-sum P < 0.01). b, Relative 
protein abundances of EMT-up proteins in 4T1-T cells when grown in 
normal and asparagine-supplemented medium (n = 3 replicates per cell 
line, rank-sum P < 0.05). ¢, Relative expression levels of EMT-up and 
-down genes in lung metastases versus primary tumours derived from 
orthotopic injection of 4T1-T cells (n=4 mice, rank-sum P< 5.0 x 10-? 
for EMT-up genes). d, Relative expression levels of Twist1, Cdh1, and 
EMT-up and -down genes in cells isolated from tumours and lungs derived 
from Asns-silenced versus -expressing 4T1-T cells (n=4 replicates per 
condition, rank-sum P< 5.0 x 10~'! for EMT-down genes in the tumour 
and rank-sum P< 5.0 x 10-8 for EMT-up genes in the lung; Cdh1 

was differentially expressed in the tumours and lungs, and Twist1 was 
differentially expressed in tumours; DESeq false discovery rate < 0.05). 
e, Representative images of immunohistochemistry (IHC) stainings for 
Twistl and Cdh1 in orthotopic tumours derived from Asns-silenced and 
-expressing 4T1-T cells. See Source Data. 


induced by Asns silencing, at both the RNA and protein levels. RNA 
measurements were the strongest predictor of protein-level changes 
(Extended Data Fig. 7a). Consistent with a previous report of transla- 
tional pausing at asparagine residues in L-asparaginase-treated cells, 
we also found asparagine content to be predictive of corresponding 
protein-level changes!” (Extended Data Fig. 7a), irrespective of whether 
they were normalized for RNA levels (Extended Data Fig. 7b). 
Among the asparagine-enriched proteins that were depleted on Asns 
silencing, we found genes whose human orthologues were upregu- 
lated after induction of the epithelial-to-mesenchymal transition 
(EMT)}3 (Fig. 4a, Extended Data Fig. 7c and Supplementary Table 6; 
EMT-up proteins). Overall, depleted proteins had 10% higher aspara- 
gine content than the analysed proteome as a whole, whereas EMT-up 
proteins had 19% higher asparagine content (Extended Data Fig. 7c). 
Human EMT-up proteins were also asparagine-enriched (Extended 
Data Fig. 7c). A re-analysis of existing ribosomal profiling data 
revealed high rates of pausing at asparagine residues within EMT-up 
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genes in L-asparaginase-treated human prostate cancer cells (Extended 
Data Fig. 7d, e), and these same proteins increased in expression 
when 4T1-T cells were cultured in elevated asparagine’? (Fig. 4b). 
Asparagine enrichment is a globally conserved property of EMT-up 
proteins, with enrichment being greatest in mammals (Extended 
Data Fig. 8). 

EMT-up genes were also downregulated at the transcriptional level 
in Asns-silenced cells, and they were increased in their RNA levels 
when these cells were grown in asparagine-supplemented medium 
(Extended Data Fig. 9a, b). A re-analysis of existing data also showed 
reduced expression of EMT-up genes when ATF4, which regulates 
ASNS transcription, was deleted in haploid cells (Extended Data 
Fig. 9c), and liver cells from L-asparaginase-treated ATF4-knockout 
mice were more perturbed in their EMT program than were simi- 
larly treated wild-type mice (Extended Data Fig. 9d)!*!°. Considered 
together, these data suggest that asparagine bioavailability might affect 
metastasis, at least in part, through regulation of EMT. 

To examine the role of EMT in metastasis in our model, we orthotopi- 
cally injected 4T1-T cells in which we had silenced expression of trans- 
forming growth factor-3 (Tgf-8, also known as Tgfb1), a key driver of 
EMT'"* (Supplementary Table 4). Primary tumour growth was unaffected 
by this manipulation (Extended Data Fig. 9e); however, the expression 
of two prototypical EMT markers (Twist1 and E-cadherin) were altered 
to indicate a perturbed EMT program (Extended Data Fig. 9f, g). Tgf-6- 
silenced cells produced fewer metastases from the primary tumour or 
when intravenously injected (Extended Data Fig. 9h, i). 

Although no differences were detected in haematoxylin and eosin 
(H&E)-stained tumour sections, morphological distinctions were 
noticeable when shRNA-infected 4T1-T cells were isolated from 
primary tumours by 6-thioguanine selection, with most Asns- 
silenced cells displaying an epithelial morphology (Extended Data 
Fig. 10a, b). Most 6-thioguanine-isolated metastatic cells displayed 
a mesenchymal morphology, regardless of Asns expression status, 
and this was matched by an increase in the expression of EMT-up 
genes (Fig. 4c). Nevertheless, EMT-up genes were downregu- 
lated in Asns-silenced versus -expressing metastatic cells, indicat- 
ing an increased representation of epithelial cells in the silenced 
populations. Similarly, EMT-down genes were upregulated, and 
Twist and E-cadherin expression measurements indicated a higher 
epithelial representation in Asns-silenced primary tumour cell popu- 
lations (Fig. 4d). These results were validated by qPCR for Twist and 
E-cadherin in Asns-silenced and -expressing cells that were isolated 
by fluorescence-activated cell sorting from primary and secondary 
lesions (Extended Data Fig. 10c, d). 

Staining for Twist and E-cadherin proteins confirmed that EMT 
is perturbed in Asns-silenced tumours, and this same pattern was 
observed in the corresponding metastases (Fig. 4e and Extended Data 
Fig. 1la—c). Similar patterns were observed in the primary tumours 
of mice that had been treated with L-asparaginase (Extended Data 
Fig. 11d, e) or subjected to dietary asparagine restriction (Extended 
Data Fig. 11f g). 

Our model of breast tumour heterogeneity has strongly implicated 
asparagine bioavailability as a regulator of metastatic progression. This 
also seems to be relevant in human cancers, as high ASNS expression 
is a marker of poor prognosis for many tumour types. One mecha- 
nism underlying our findings is a probable link between asparagine 
bioavailability and EMT, which can be observed in vitro and in vivo. 
In our breast cancer model, the gating step probably occurs in the cir- 
culation, where asparagine levels are low and are strongly affected by 
either L-asparaginase treatment or dietary restriction. Nonetheless, we 
do see effects on ratios of epithelial- to mesenchymal-like tumour cells 
at the primary and secondary sites, which could also affect both tumour 
progression and response to therapy. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Cell culture. The mouse mammary tumour cell line 4T1 (American Type Culture 
Collection, ATCC) and any derived clonal cell line were cultured in DMEM high 
glucose supplemented with 5% fetal bovine serum, 5% fetal calf serum, MEM 
NEAA, and penicillin—-streptomycin (Thermo Fisher Scientific). The human breast 
tumour cell line MDA-MB-231 (ATCC) was cultured in DMEM high glucose 
supplemented with 10% fetal bovine serum, NEAA and penicillin-streptomycin 
(Thermo Fisher Scientific). The 4T1 and MDA-MB-231 cell lines were tested and 
authenticated by ATCC. The Platinum-A (Cell BioLabs) and 293FT (Thermo Fisher 
Scientific) packaging cell lines for virus production were cultured in DMEM high 
glucose supplemented with 10% fetal bovine serum and penicillin-streptomycin. 
All cell lines were routinely tested for mycoplasma contamination. 

Virus production. Retroviral vectors were packaged using the Platinum-A (Cell 
BioLabs) cell line and lentiviral vectors were packaged using the 293FT cell line 
(Thermo Fisher Scientific) as previously described’. 

Animal studies. All mouse experiments were approved by the Cold Spring 
Harbour Animal Care and Use Committee. The maximal permitted tumour size of 
20mm in any direction was never exceeded. All mouse injections were performed 
with 6- to 8-week-old female NOD-SCID-Il2rg-/~ (NSG) mice (JAX). Balb/c mice 
were not used in this study as the different clonal cell lines have variable GFP 
levels due to the lentiviral barcode vector. Tail vein injections were performed 
using 5 x 10° mouse mammary tumour cells, which were re-suspended in 100,11 
of PBS and injected via the tail vein. Orthotopic injections were performed 
using 1 x 10° mouse mammary tumour cells or 5 x 10° MDA-MB-231 cells. 
For this, cells were re-suspended in a 1:1 mix of PBS and growth-factor-reduced 
Matrigel (BD Biosciences). A volume of 201] was injected into mammary gland 
number 4 for mouse mammary tumour cells and a volume of 40 1l was injected 
for MDA-MB-231 cells. Primary tumour volume was measured using the 
formula V=4(L x W?), in which L is length and W is width of the primary 
tumour. For L-asparaginase studies, mice were administered 20011 of 60 U 
L-asparaginase (Abcam) five times per week through intraperitoneal injections. 
For L-asparagine-adjusted diets, mice were given a control amino-acid diet (0.6% 
asparagine), an asparagine-deficient diet (0% asparagine), or an asparagine-rich 
diet (4% asparagine). All diets were isonitrogenous and contained similar calorie 
densities. Sample size was chosen to give sufficient power for calling significance 
with standard statistical tests. Tumour and metastasis experiments were performed 
with ten animals per condition to account for the variability observed in such 
in vivo experiments. Animals were assigned to treatment groups through random 
cage selection. Mice were excluded from the analysis if the primary tumour 
engrafted into the peritoneum. All image quantifications were performed in a 
blinded manner. Source data are available for tumour volume measurements and 
lung metastases. 

Barcode analysis. The barcodes of the 4T1-E and 4T1-T cells were amplified and 
sequenced as previously described!. 

In vivo shRNA lung screen and in vitro invasion screen. shRNAs were predicted 
on the basis of the Sherwood algorithm described in ref. 4. Pools of approximately 
50 shRNAs were packaged in Platinum-A cells. For each pool, 10 million 4T1- 
T cells were infected at a multiplicity of infection of 0.3. The infected cells were 
selected with 500,.g ml! hygromycin for 5 days and each pool was injected into 
five mice each via the tail vein. Two pre-injection pools were collected at the time 
of injection. After 7 days, mice were euthanized and perfused with PBS to remove 
blood and non-extravasated cells from the lungs. Lungs were collected and genomic 
DNA was isolated using phenol chloroform extraction. Genomic DNA of the pre- 
injection pools was isolated using a QIAamp DNA Blood Mini Kit (Qiagen). 

The in vitro invasion assays were performed in parallel. Each pool was plated on 
two six-well BioCoat Matrigel invasion plates (Corning). Six hundred thousand cells 
were plated on top of each well in cell culture medium without serum. Cells were 
allowed to invade through the Matrigel into medium containing 5% fetal bovine 
serum and 5% fetal calf serum for 24h. Invaded cells were scraped off, washed 
with PBS, and genomic DNA was isolated using a QlLAamp DNA Blood Mini Kit. 

The shRNAs were amplified using a two-step PCR protocol previously described 
in ref. 4 for next-generation sequencing. 

First PCR forward primer 1: 5-CAG AAT CGT TGC CTG CAC ATC TTG GAA 
AC-3 and reverse primer 1: 5-CTG CTA AAG CGC ATG CTC CAG ACT GC-3. 

Second PCR forward primer 2: 5-AAT GAT ACG GCG ACC ACC GAG ATC 
TAC ACT AGC CTG CGC ACG TAG TGA AGC CAC AGA TGT A-3 and reverse 
primer 2: 5-CAA GCA GAA GAC GGC ATA CGA GAT NNN NNN GTG ACT 
GGA GTT CAG ACG TGT GCT CTT CCG ATC TCT GCT AAA GCG CAT 
GCT CCA GAC TGC-3. The reverse primer contained a barcode (NNNNNN) 
that enabled multiplexing. 

Analysis of screening data. shRNA quantification was performed as previously 
described*. For each pool, shRNA quantities in each sample were normalized by 
their sum and log-fold changes were calculated using the initially infected cell 
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population as a reference. Fold change values for each pool were then consolidated 
into a single table and an empirical Bayes-moderated t-test was applied to identify 
depleted molecules. Genes with two or more depleted shRNAs were considered 
hits in each screen. 
Gene Ontology enrichment analysis. Gene Ontology enrichment analysis used 
the GOrilla web portal’’. The Refseq identifiers of genes identified as overexpressed 
in 4T1-T cells, compared with 4T1-E cells, were used as foreground and the entire 
Refseq gene list was used as background. 
Expression subtype and relapse analysis. For all clinical data analysis, we used the 
University of North Carolina ‘855 patient set, which is available as published data at 
https://genome.unc.edu. All data were derived from an initial matrix that was arranged 
with patients on the horizontal axis and genes on the vertical axis. Initial normalization 
involved quantile normalization to ensure that the global expression profile of each 
patient was similar. Following this, each gene was z-score normalized across patients. 
For Extended Data Fig. 1a, the average expression level of each gene was calculated 
for each patient subtype. For Extended Data Fig. 1b, the average expression level of 
each gene was calculated for each gene for patients with and without relapse to each 
secondary site. For Extended Data Fig. 1d, the level of ASNS in each patient, stratified 
by subtype, is plotted. For Extended Data Fig. le, each boxplot represents the level of 
ASNS in each patient with and without relapse to each secondary site. 
Individual in vitro invasion assay. The in vitro invasive capacity of cells was meas- 
ured using six-well BioCoat Matrigel invasion plates. For parental 4T1 cells, 1 x 10° 
cells were plated on individual wells; for 4T1-T cells, 8 x 10° cells were plated on 
individual wells; and for MDA-MB-231 cells, 5 x 10° cells were used per indivi- 
dual well. Cells were re-suspended in medium without serum or NEAA and cells 
invaded into medium with 5% fetal bovine serum and 5% fetal calf serum. For 
Fig. 3a and Extended Data Fig. 4b, 4T1 and MDA-MB-231 cells were cultured in 
medium containing 100x concentration of the specified amino acid (relative to 
the concentration achieved by supplementing the medium with 1x NEAA) for 2 
or 3 days, respectively, before starting the invasion assay. After 24h, non-invaded 
cells were removed and the invasion wells were washed in PBS, fixed in 2% glu- 
taraldehyde for 2 min, and stained with 0.5% crystal violet for 10 min. The wells 
were washed in distilled H2O, air-dried, and scanned using an Odyssey infrared 
scanner. The signal was quantified using ImageJ (NIH). 
Competition and proliferation assay. For Extended Data Fig. 2c, d, cells 
were grown in medium lacking NEAA. For the mCherry competition assay, 
shRNA-transduced mCherry-positive cells were admixed with untransduced 
cells. mCherry fluorescence was quantified on an LSR II flow cytometer (BD 
Biosciences). The proliferation assay was performed using a CellTrace Violet Cell 
Proliferation Kit (Thermo Fisher Scientific). For Extended Data Fig. 4d, e, 4T1 
and MDA-MB-231 cells were cultured in medium containing 100x concentra- 
tion of the specified amino acid (relative to the concentration that is achieved by 
supplementing the medium with 1 x NEAA) for 2 or 3 days, respectively, before 
starting the proliferation assay. Cells were stained with CellTrace violet and then 
trypsinized and re-suspended in the medium. After 24h, cells were collected to 
quantify violet fluorescence intensity using an SH800 flow cytometer (Sony). 
Isolation of tumour and lung metastatic cells. Tumour and lung tissues were 
collected, minced, and digested into single cells as previously reported’. Cells 
were either grown in 4T1 cell culture medium containing 601M 6-thioguanine to 
deplete stromal cells or directly sorted on the basis of mCherry expression using 
the FACSAria III cell sorter (BD Biosciences). 
RNA sequencing library preparation. RNA sequencing libraries from cultured 
4T1-T cells were prepared as previously described!. Each sample was sequenced on 
an IIJumina HiSeq sequencer generating 76-nucleotide single-end reads. 
RNA sequencing analysis. Illumina sequencing reads were aligned to the mouse 
genome (mm10) using Bowtie2 with default parameters’*. Genes were assigned 
a count using HTseq-count, and differential expression analysis was performed 
using DESeq)?””. 
shRNA knockdown and cDNA overexpression studies. Mouse and human cell 
lines were transduced with shRNA expressing retroviral or lentiviral constructs, 
respectively. After infection, 4T1-T cells were selected with 500,.g ml’ hygromycin 
for 5 days and MDA-MB-231 cells were selected with 21g ml! puromycin for 
4 days. Cell lines infected with cDNA overexpressing retroviral constructs were 
selected with G418 for 1 week. The parental 4T1 cell line was selected with 
600 pg ml! G418 and MDA-MB-231 cells were selected with 1,500 1g ml“! G418. 
cDNA overexpression genes. Mouse Asns: NM_012055.1. Human ASNS: 
NM_001673.2. 
shRNA knockdown sequences. Mouse shAsns- 1: T@CTGTTGACAGTGAGCGCC 
ACTGCCAATAAGAAAGTATATAGTGAAGCCACAGATGTATATACTTTCT 
TATTGGCAGTGTTGCCTACTGCCTCGGA. 

Mouse shAsns-2: TGCTGTTGACAGTGAGCGCCACTATGAAGTTTTGGA 
TTTATAGTGAAGCCACAGATGTATAAATCCAAAACTTCATAGTGTTGCC 
TACTGCCTCGGA. 
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Mouse shTgfb1-1: TGCTGTTGACAGTGAGCGCCAGTATATA 
TATGTTCTTCAATAGTGAAGCCACAGATGTATTGAAGAACATATATATAC 
TGTTGCCTACTGCCTCGGA. 

Mouse shTgfb1-2: T@CTGTTGACAGTGAGCGAAGTATATATATGTTCTT 
CAAATAGTGAAGCCACAGATGTATTTGAAGAACATATATATACTGTGCC 
TACTGCCTCGGA. 

Human shASNS-1: TGCTGTTGACAGTGAGCGCCAGAAGCTAAAGGTC 
TTGTTATAGTGAAGCCACAGATGTATAACAAGACCTTTAGCTTCTGATGC 
CTACTGCCTCGGA. 

Human shASNS-2: TGCTGTTGACAGTGAGCGCAGCAATGACAGAAGA 
TGGATATAGTGAAGCCACAGATGTATATCCATCTTCTGTCATTGCTTTG 
CCTACTGCCTCGGA 
Quantitative RT-PCR. Total RNA from cells was purified and DNase treated 
using an RNeasy Mini Kit (Qiagen). For whole tissues, RNA was isolated using a 
TRIzol Plus RNA Purification Kit (Thermo Fisher Scientific). The tissue lysate was 
homogenized using a Dounce homogenizer and passed through a column homog- 
enizer (Thermo Fisher Scientific) to reduce viscosity. RNA integrity (RNA integrity 
score > 9) was measured on an Agilent Bioanalyzer (RNA nano kit). cDNA was 
synthesized using SuperScript III Reverse Transcriptase (Sigma). qPCR was per- 
formed on an Eppendorf Mastercycler ep realplex. All signals were quantified using 
the AC, method and were normalized to the levels of Gapdh. For mCherry-posi- 
tive flow cytometer sorted tumour and lung metastatic cells, cDNA was produced 
directly from lysed cells using a TaqMan Gene Expression Cells-to-Ct Kit (Thermo 
Fisher Scientific). qPCR was performed on an CFX96 (Bio-Rad) using TaqMan 
primer/probe sets and all signals were quantified as described above. 
Quantitative RT-PCR primers. Mouse Asns (exon 1-2): 5/-CCT CTG CTC CAC 
CTT CTC T-3/; 5’-GAT CTT CAT CGC ACT CAG ACA-3’, 

Mouse Asns (exon 6-7): 5/-CCA AGT TCA GTATCC TCT CCA G-3/; 5/-CTT 
CAT GAT GCT CGCTTC CA-3’. 

Mouse Tgfb1 (exon 1-2): 5’-CCG AAT GTC TGA CGT ATT GAA GA-3’; 
5'-GCG GAC TAC TAT GCT AAA GAG G-3’, 

Mouse Tgfb1 (exon 3-4): 5‘-GTT ATC TTT GCT GTC ACA AGA GC-3’; 
5!-CCC ACT GAT ACG CCT GAG-3’. 

Mouse Gapdh (exon 2-3): 5‘-AAT GGT GAA GGT CGGTGT G-3’; 5’-GTG 
GAGTCA TACTGG AAC ATG TAG-3’. 

Human ASNS (exon 8-9): 5‘-GAGTCA GAC CTT TGT TTA AAG CA-3’; 
5/-GGA GTG CTT CAATGT AAC AAG AC-3’. 

Human ASNS (exon 12-13): 5’-CTG GAT GAA GTC ATATTT TCC TTG G-3’; 
5/-CAG AGA AGATCA CCA CGCTAT C-3’. 

Human GAPDH (exon 2-3): 5‘-ACA TCG CTC AGA CAC CAT G-3'; 5/-TGT 
AGT TGA GGT CAA TGA AGG G-3/ 

TaqMan probes. Mouse Asns: Mm00803785_m1. Mouse E-cadherin (Cdh1): 
Mm01247357_m1. Mouse Twistl: Mm00442036_m1. Mouse Gapdh: 
Mm99999915_g1. 
qPCR for CTCs. CTCs were quantified as previously described!. Genomic DNA 
was isolated from blood and CTC abundance was quantified using a qPCR assay 
against mCherry, which was expressed from the retroviral shRNA delivery vectors. 
mCherry probes and primers were as follows: primer 1, 5‘-GACTACTTGAAGC 
TGTCCTTCC-3’; primer 2, 5’-CGCAGCTTCACCTTGTAGAT-3’. 

HEX probe: 5/-/56-FAM/TTCAAGTGG/ZEN/GAGCGCGTGATG 
AA/3IABkFQ//-3', 

Housekeeping probes and primers: primer 1, 5‘-GACTTGTAACGGGCA 
GGCAGATTGTG-3’; primer 2, 5/-GAGGTGTGGGTCACCTCGACATC-3’. 

HEX probe: 5/-/S5HEX/CCGTGTCGC/ZEN/TCTGAAGGGCAATAT/ 
3IABKFQ/-3’. 

Quantification of lung metastasis. For each lung, 5|1m sections were prepared and 
stained with a standard H&E protocol. Lung metastatic burden was determined by 
counting individual lung nodules on one section. 

E-cadherin and Twist1 analysis. For immunohistochemistry, primary tumours 
and lungs were processed as previously described'. E-Cadherin (24E10) Rabbit 
mAb (3195, Cell Signaling) was used in a 1:400 dilution and Twist1 (Twist2Cla) 
Mouse mAb (ab50887, Abcam) was used in a 1:100 dilution. E-cadherin and Twist] 
diaminobenzidine-stained (DAB) and haematoxylin stainings were quantified 
using Image] (NIH). For this, images were colour deconvoluted according to ref. 21 
and the percentage area of E-cadherin- and Twist] -positive staining was measured. 
Free amino-acid quantification using HPLC. Free amino acids were quantified 
in cultured cells and blood serum. For Extended Data Fig. 2e, 4T1-T cells were 
cultured under normal medium conditions. For Extended Data Fig. 4a, c, 4T1 and 
MDA-MB-231 cells were cultured in medium containing 100 x concentration of 
the specified amino acid (relative to the concentration achieved by supplementing 
the medium with 1x NEAA) for 2 or 3 days, respectively. All cultured cells were 
washed in ice-cold PBS, homogenized using a Dounce homogenizer, and the lysate 
subsequently filtered. Each biological sample was quantified in triplicates using 


HPLC and a fluorometric detector. For each replicate, nanomoles of each amino 
acid were measured. 

Proteomic profiling using isobaric tags for relative and absolute protein 
quantification. All analyses were performed with three biological replicates per 
treatment. Cell pellets were lysed with 0.1 M triethylammonium bicarbonate, 
0.1% SDS buffer, followed by tip sonication and boiling at 90°C for 5 min. Protein 
concentration was estimated using a Bradford assay (Bio-Rad, Quick Start), and 
90 xg of total protein per sample were reduced with tris-2-carboxymethyl phos- 
phine (Sigma) for 1h at 60°C at a final concentration of 5mM. Cysteines were 
blocked for 10 min at room temperature using methyl methanethiosulfonate 
(Sigma) at a final concentration of 10mM. Samples were digested overnight at 
37°C with trypsin (Pierce) and peptides were labelled with the TMT 10plex rea- 
gents according to the manufacturer's instructions (Thermo Fisher Scientific). The 
TMT mixture was fractionated on a Dionex Ultimate 3000 system at high pH using 
a XBridge C18 column (3.5j1m, 2.1 x 150mm, Waters). Fractions were analysed on 
a Dionex Ultimate 3000 UHPLC system coupled with a nano Q-Exactive (Thermo 
Fisher Scientific) mass spectrometer. Samples were loaded on an Acclaim PepMap 
100, 100jum x 2cm C18, 5,m, 100A trapping column with the ulPickUp injection 
method at a loading flow rate of 4,11 min! for 10 min. For peptide separation, an 
EASY-Spray analytical column 75 um x 25cm, C18, 2j1m, 100 A column was used 
for multi-step gradient elution. Mobile phase (A) was composed of 2% acetonitrile, 
0.1% formic acid, 5% dimethyl sulfoxide (DMSO); mobile phase (B) was composed 
of 80% acetonitrile, 0.1% formic acid, 5% DMSO. The full scans were obtained at 
70,000 resolution and the MS2 scans at 35,000 resolution, with collision energy 33% 
and isolation window 1.2 Th. Raw data were processed with the SequestHT search 
engine on Proteome Discoverer 1.4 software. The parameters for the SequestHT 
node were as follows: precursor mass tolerance 20 p.p.m., fragment mass tolerance 
0.02 Da; dynamic modifications were oxidation of M (+15.995 Da), deamidation 
of N, Q (+0.984 Da); and static modifications were TMT6plex at any N terminus, 
K (4+229.163 Da) and Methylthio at C (+45.988 Da). 

For each sample, raw peptide intensities were normalized by their sum. Peptides 
that had a representation that fell within the bottom quartile based on summed 
intensities across samples were not analysed further. For the dataset represented 
in Fig. 4a and Extended Data Fig. 7a, b, proteins were removed from subsequent 
analysis if fewer than 5 corresponding proteins were identified in any sample. 
Protein-level differences between treatments were calculated based on the trimmed 
mean of corresponding peptide log-fold changes (trimming the top and bottom 
25% for the experiment represented in Fig. 4a and Extended Data Fig. 7a, b, and 
the top and bottom 10% for the experiment represented in Fig. 4b). Relative pro- 
tein abundances were calculated based on the trimmed mean of corresponding 
log-transformed peptide intensities (trimming the top and bottom 25% and 10% 
for the experiments representing in Fig. 4a and b, respectively). 

Metabolite profiling using LC-MS/MS. Organ tissue samples were placed in 
2 ml lysing tubes prefilled with 1.4-mm ceramic beads for mammary glands or 
2.8-mm ceramic beads for lungs and 1 ml of pre-chilled (—80°C) 80% methanol. 
Samples were homogenized with a Precellys24 homogenizer (Bertin Instruments) 
programmed with three 30-s cycles at 6,500 Hz and 4-min pause times. At the end 
of each cycle, samples were snap-frozen in liquid nitrogen and placed on dry ice. 
Metabolite extraction of blood serum samples (5011) was performed using 200 1l 
of 80% methanol at —80°C. After centrifugation for 10 min (13,200 revolutions 
per minute, 4°C), supernatants were evaporated to dryness and stored at —80°C 
until liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis. 

Dried-down extracts were re-suspended in 2511 HPLC-grade water, and 1 il 
was analysed using hydrophilic interaction chromatography coupled to tandem 
mass spectrometry analysis operated in the selected reaction monitoring mode 
(LC-SRM-MsS). Analytical instrumentation consisted of a Nexera X2 (Shimadzu) 
liquid chromatography system with a XBridge BEH amide 2.1 x 100mm, 2.5-1m 
column (Waters) and a QTRAP 6500 hybrid triple quadrupole/linear ion trap mass 
spectrometer (SCIEX) equipped with an electrospray ion source. Raw LC-SRM-MS 
data were acquired with Analyst 1.6.2 (SCIEX) and peak areas of LC-SRM-MS 
traces for each metabolite were integrated using MultiQuant 1.1 software (SCIEX). 
Amino acid compositional analysis. For Extended Data Fig. 7b, the amino acid 
representations of genes that showed the greatest decrease in their protein or RNA 
levels (bottom 25% log-fold change) were compared to those of the analysed gene 
set, using a rank-sum test. For the amino acid enrichment analysis of protein minus 
RNA level expression changes, transcriptional- and protein-level log-fold changes 
were quantile normalized to the same distribution. RNA-level changes were then sub- 
tracted from protein-level changes. The amino acid representations of genes whose 
subsequent values fell within the lowest 25% were then compared to the analysed 
gene set using a rank-sum test, to identify amino acids whose abundance correlated 
with the protein-level changes that were not explained by transcriptional changes. 

For Extended Data Fig. 7c, the same analysis was performed, this time com- 
paring the proteins of genes that had been detected as upregulated during EMT 
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compared with all other genes. For the mouse amino-acid enrichment, the EMT-up 
genes were mouse orthologues of the EMT-up human genes. 

For Extended Data Fig. 8, each organism harbouring a minimum of ten genes 
that were orthologues of the pro-EMT human genes described in Extended Data 
Fig. 7c were analysed. For each organism, the asparagine percentage of each 
protein was calculated. Then the asparagine enrichment level for each organism 
was determined by calculating the ratio of the median asparagine percentage of 
pro-EMT proteins versus the remaining organism-specific proteins. The statistical 
significance of enrichment was calculated as described for Extended Data Fig. 7c. 
Ribosome profiling analysis. For analysis, quality trimming and sequencing 
adapter removal was performed using Cutadapt””. Bowtie2 was used to remove 
reads that mapped to contaminating RNAs (for example, rRNA and tRNA 
sequences)'®. STAR was subsequently used to map reads of length 29-33 to the 
human transcriptome”*. The offset was corrected for each read on the basis of read 
length, and 12 and 15 nucleotides downstream were marked as P and A sites. For 
each gene, we then calculated the number of events at all positions and aggregated 
the counts for each codon and subsequently amino acid. Genes with fewer than ten 
events in any sample were not considered in subsequent analyses. 

Code availability. All custom code used during the current study is available from 
the corresponding author upon reasonable request. 

Data availability. All raw and processed high-throughput sequencing data are 
available at the Gene Expression Omnibus under accession numbers GSE104968 
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and GSE107109. The mass spectrometry proteomics data have been deposited 
in the ProteomeXchange consortium via the PRIDE” partner repository, under 
the dataset identifier PXD008805. Source Data are available for tumour volume 
measurements, lung metastases and proteomic measurements. All other datasets 
generated during and/or analysed during the current study are available from the 
corresponding author upon reasonable request. 
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Extended Data Figure 1 | See next page for caption. 
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Extended Data Figure 1 | Analysis of ASNS expression levels in patient 
data. a, Expression level of genes identified as overexpressed in 4T1-T 
compared with 4T1-E in the primary tumours of patients with different 
disease subtypes (edges of the box are the 25th and 75th percentiles and 
error bars extend to the values q3 + w(q3 — qi) and q; — w(q3 — qi), in 
which w is 1.5 and q; and q; are the 25th and 75th percentiles, which is 
also true for b, d, and e, ANOVA P< 0.0001). b, Expression level of the 
same genes in disease-free survivors and patients with relapse to the lung 
(rank-sum P < 0.01). ¢, For each gene that was identified in the screen, 

a prognostic value was calculated using three different datasets. One 
consisted of gene expression measurements in three patient-matched 
basal tumour and metastasis pairs (patients Al, A7, and A11). Here 

genes were classified as correlated with progression if expression was 
higher in each of the metastases and negatively correlated if expression 
was higher in each of the primaries. The other two datasets consisted of 
primary tumour gene-expression profiles with matched outcomes. For the 
UNC254 patient dataset, the site of relapse was not available and genes 
were deemed positively correlated with progression if they had significant 
(Cox P < 0.05) relapse-free survival hazard ratios greater than 1, and 
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negatively correlated if these ratios were significant (Cox P < 0.05) and less 
than 1. As the UNC855 dataset also had site of relapse information, here 
both relapse-free and lung relapse-free survival (RFS and LRFS) hazard 
ratios were used to classify genes as positively or negatively correlated with 
progression based on the same criteria that were used for the UNC254 
data. d, Expression level of ASNS in the primary tumours of patients 

with different disease subtypes (ANOVA P< 0.0001). e, Expression level 
of ASNS in the primary tumours of patients with non-specific relapse 

and relapse to the lymph node, bone, brain, liver, or lung compared with 
expression levels in patients without relapse to each corresponding site 
(rank-sum P < 0.005). f, Analysis of ASNS in three additional sets from 
patients with breast cancer (MDACC, METRABIC, and TCGA). Shown 
are survival plots and relevant statistics (Cox P< 0.01). g, Analysis of 
ASNS in the TCGA Pan-Cancer expression data. Shown are survival plots 
and relevant statistics for the ten non-breast solid tumours represented in 
the dataset (Cox P < 0.05 for colon, squamous head and neck, renal clear 
cell, and endometrial cancers). h, Analysis of ASNS across all tumours 
represented in the TCGA Pan-Cancer dataset (Cox P=1.5 x 1077), 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Primary validation of Asns as a driver of 
invasion and metastasis. a, Representative images of the lungs of mice 
that were intravenously injected with Asns-silenced or -expressing 4T1-T 
cells as described in Fig. 2a. b, Quantification of Matrigel invasion capacity 
for Asns-silenced and -expressing 4T1-T cells (n = 3 replicates per cell 
line). c, Quantification of mCherry-positive 4T1-T cells after roughly 50% 
of cells were infected with mCherry-expressing constructs harbouring 
shRNAs targeting Renilla luciferase and Asns. Cells were grown during the 
24-h period that the Matrigel invasion assay described in Fig. 2b was being 
performed (n= 3 replicates per cell line). d, Violet cell-labelling intensity 
of Asns-silenced and -expressing 4T1-T cells, relative to the initial 
population. Cells were grown during the 24-h period that the Matrigel 
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invasion assay described in Fig. 2b was being performed (n = 3 replicates 
per cell line). e, Free amino-acid quantification by HPLC for each 

amino acid in Asns-expressing and -silenced cells. Shown are the 
log-fold changes for each amino acid (n = 3 replicates per cell line). 

f, Quantification of mCherry-positive 4T1-T cells after roughly 50% 

of cells were infected with mCherry-expressing constructs harbouring 
shRNAs targeting Renilla luciferase and Asns. After infection, cells were 
grown in medium supplemented with L-asparagine or D-asparagine and 
mCherry percentages were measured at 48 and 96h (n= 3 replicates per 
cell line). g, Quantification of Matrigel invasion for Asns-silenced and 
-expressing cells when assayed in medium supplemented with and without 
L-asparagine (n= 3 invasion chambers per cell line). 
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Extended Data Figure 3 | Secondary validation of Asns as a driver of 
invasion and metastasis. a, Volume measurements of tumours resulting 
from orthotopic injection of Asns-silenced and -expressing parental 

4T1 cells (n= 10 mice per cell line, edges of the box are the 25th and 
75th percentiles and error bars extend to the values q3 + w(q3 — qi) and 
qi — w(q3 — qi), in which w is 1.5 and q, and q3 are the 25th and 75th 
percentiles, which is also the case for b-g). b, Quantification of lung 
metastases corresponding to the tumours described in a (rank-sum 
P<0.002). c, Volume measurements of tumours resulting from orthotopic 
injection of parental 4T1 cells with basal (Empty) or enforced expression 
of Asns (n= 10 mice per cell line). d, Quantification of lung metastases 


corresponding to the tumours described in c (rank-sum P< 5.0 x 107°). 

e, Average diameters of the metastases of each mouse described in d 
(rank-sum P< 0.001). f, Volume measurements for tumours resulting 
from orthotopic injection of MDA-MB-231 cells with basal (Empty) or 
enforced expression of ASNS (n = 10 mice per cell line). g, Quantification 
of lung metastases corresponding to the tumours described in f (rank-sum 
P<0.005). h, Quantification of Matrigel invasion for the MDA-MB-231- 
derived cell lines described in f (n =3 invasion chambers per cell line). 

i, Representative images of the collection wells for the invasion assays 
described in h. See Source Data. 
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Extended Data Figure 4 | Primary validation that extracellular 
asparagine availability affects invasion and metastasis. a, HPLC 
quantification of cellular free amino-acid percentages for parental 

4T1 cells when the medium is supplemented with each of the NEAAs 
lacking in the DMEM culture medium (n = 3 replicates per cell line). 

b, Quantification of MDA-MB-231 Matrigel invasion rates under the same 
conditions as described in Fig. 3a (n =5 invasion chambers per condition, 
rank-sum P< 0.001). ¢, HPLC quantification of cellular free amino- 

acid percentages for MDA-MB-231 cells when cultured in the medium 
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conditions described in a (n = 3 replicates per cell line). d, Violet cell- 
labelling intensity of parental 4T1 cells when grown in asparagine-lacking 
or -supplemented medium for the same period that the Matrigel invasion 
assay described in Fig. 3a was being performed (n= 3 replicates per cell 
line). e, Violet cell-labelling intensity of MDA-MB-231 cells when grown 
in asparagine-lacking or -supplemented medium for the same period that 
the Matrigel invasion assay described in b was being performed (n= 3 
replicates per cell line). 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Secondary validation that extracellular 
asparagine availability affects invasion and metastasis. a, Tumour 
volumes resulting from the orthotopic injection of parental 4T1 cells. 
Half of the mice received L-asparaginase and the other half received an 
equivalent volume of PBS at the same injection rate (n = 10 mice per 
condition, edges of the box are the 25th and 75th percentiles and error 
bars extend to the values q3 + w(q3 — qi) and q; — w(q3 — qi), in which 

w is 1.5 and q, and q; are the 25th and 75th percentiles, which is also the 
case for b-g). b, Quantification of lung metastases detected in the animals 
described in a (rank-sum P< 0.001). c, Representative H&E-stained 

lung sections as described in b. d, Quantification of the lung metastases 
described in Fig. 3b, in which Asns-silenced and -expressing 4T1-T cells 
were injected into mice. Half of the mice received L-asparaginase and the 
other half received an equivalent volume of PBS at the same injection rate 


LETTER 


(n= 10 mice per condition, rank-sum P < 0.0005 for L-asparaginase versus 
control for each line and for Asns-silenced versus -unsilenced cells in each 
drug condition). e, Tumour volumes corresponding to the lung metastases 
described in d (rank-sum P < 0.005 for Asns-silenced versus -expressing 
cells in L-asparaginase-treated mice). f, Lung metastases resulting from the 
orthotopic injection of ASNS-silenced and -expressing MDA-MB-231 cells 
and subsequent treatment of the injected animals with L-asparaginase or 
PBS (n= 10 mice per cell line, rank-sum P < 0.05 for ASNS-silenced versus 
-expressing cells in both conditions and for silenced cells in treated versus 
untreated mice). g, Tumour volumes corresponding to the mice described 
in f (rank-sum P < 0.05 for Asns-silenced versus -expressing cells under 
both treatments and for PBS versus L-asparaginase-treated animals for 
each cell line). See Source Data. 
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Extended Data Figure 6 | Tertiary validation that extracellular resulting from the orthotopic injection of parental 4T1 cells into mice 
asparagine availability affects invasion and metastasis. a, Asparagine fed with 0%, 0.6%, or 4% asparagine diets (n = 10 mice per diet). 
content in the serum free amino-acid pool, for mice fed 0%, 0.6%, or 4% e, Quantification of metastases in the lungs of the animals described in 
asparagine diets (n =5 mice per diet, edges of the box are the 25th and d (rank-sum P < 0.05 for mice receiving 0% versus 0.6% and 0% versus 
75th percentiles and error bars extend to the values q3 + w(q3 — qi) and 4% diets). f, Representative images of H&E-stained sections of the lungs 
qi — w(q3 — qi); in which w is 1.5 and q; and q3 are the 25th and 75th described in e. g, Relative expression of Asns in the mammary gland, 
percentiles, which is also the case for b, d, e and h, rank-sum P< 0.05 serum, and lungs of mice treated with L-asparaginase or PBS, as 
between each diet). b, Volumes of orthotropic tumours corresponding measured by qPCR with two primer pairs P1 and P2 (n =3 per 
to the lung metastases described in Fig. 3c, in which Asns-silenced condition). h, Transcripts per million (TPM) expression measurements 
and -expressing 4T1-T cells were orthotopic injected into mice fed for ASNS in human breast, lung, and whole-blood samples (n > 114 for 
with 0%, 0.6%, and 4% asparagine diets (n = 10 mice per condition). each tissue, rank-sum P < 2.8 x 107%” for blood versus breast and blood 
c, Representative images of the lung metastases described for Fig. 3c, versus lung). See Source Data. 


which also correspond to the mice described in b. d, Volumes of tumours 
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Extended Data Figure 7 | Primary validation that asparagine 
availability regulates EMT. a, Protein-level changes between Asns- 
silenced and -expressing cells when genes are stratified by transcription- 
level changes (top and bottom 10% of genes based on log-fold change 
in Asns-silenced cells, gene-up and -down, respectively) and asparagine 
content (top and bottom 10% of genes based on asparagine content, 
Asp-high and -low, respectively), edges of the box are the 25th and 75th 
percentiles and error bars extend to the values q3 + w(q3 — qi) and 

qi — w(q3 — qi), in which w is 1.5 and q, and q; are the 25th and 75th 
percentiles, which is also the case for d and e, rank-sum P< 5.0 x 10-4 
for both individual variables, and rank-sum P < 0.005 for interacting 
variables). b, Amino acid enrichment analysis of downregulated genes 
(bottom 25% based on log-fold change) on the basis of RNA and 
protein levels in Asns-expressing versus -silenced 4T1-T cells. Negative 
correlations indicate the amino acid is depleted in the downregulated 
genes, whereas positive correlations indicate the amino acid is enriched. 
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For protein minus RNA level expression changes, amino acids with 
positive correlations are enriched in proteins in which depletion levels 
exceed what is predicted by corresponding RNA changes. Negative 
correlations indicate the amino acid is enriched in proteins in which 
depletion levels are less than what is predicted by corresponding RNA 
changes (rank-sum P< 1.0 x 107° for asparagine in protein and protein- 
RNA). c, Amino-acid enrichment in mouse and human EMT-up proteins 
(rank-sum P < 0.01 for both human and mouse). d, Position 15 asparagine 
codon enrichment in ribosome protected fragments from PC-3 cells 
grown with and without L-asparaginase, when all genes or only EMT-up 
genes are analysed (outliers were not plotted to improve interpretability, 
which is also the case for e, rank-sum P < 0.05 for EMT-up versus all 
genes in both untreated and L-asparaginase-treated cells). e, Increase in 
asparagine codon representation in ribosome protected fragments, when 
PC-3 cells are grown in L-asparaginase (relative to without), and all genes 
or EMT-up genes are analysed (rank-sum P < 0.05). See Source Data. 
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Extended Data Figure 8 | Conservation of asparagine enrichment in Matrix database that harbour at least 10 orthologues (sign-rank 
EMT promoting proteins. Asparagine enrichment analysis of EMT- P<1.0 x 107} for all species and rank-sum P< 9.0 x 10~° for mammals 
promoting protein orthologues in the 126 species listed in the Orthologous _ versus other species). 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | Secondary validation that asparagine 
availability regulates EMT. a, Transcription-level changes in EMT-up and 
-down genes that occur in response to Asns silencing in 4T1-T cells (n =2 
replicates per condition, edges of the box are the 25th and 75th percentiles 
and error bars extend to the values q3 + w(q3 — qi) and q; — w(q3 — qi), in 
which w is 1.5 and q; and q3 are the 25th and 75th percentiles, which is also 
the case for b-e, h and i, rank-sum P < 0.001 for EMT-up genes, DESeq 
false discovery rate < 0.05 for Twistl and Cdh1). b, Transcription-level 
changes in EMT-up and -down genes that occur in response to the medium 
of Asns-silenced 4T1-T cells being supplemented with L-asparagine (n =2 
replicates per condition, rank-sum P< 0.005 for EMT-up genes). c, Gene 
expression changes in EMT-up and -down genes that result from ATF4 
knockout in near haploid KBM-7 chronic myelogenous leukaemia (HAP-1 
cells, rank-sum P < 0.05 for EMT-up genes). d, Gene expression changes 
in EMT-up and -down genes, which result in the liver cells of homozygous 


ATF4-deleted mice when treated with L-asparaginase (rank-sum P < 0.05 
for EMT-down genes in wild-type mice treated with L-asparaginase 
(WT+L-asp) mice, and both EMT-up and -down genes in ATF4 mice 
treated with L-asparaginase (ATF4+L-asp) mice). e, Volumes of tumours 
resulting from orthotopic injection of Tgf-3-silenced and -expressing 
4T1-T cells (n= 10 mice per cell line). f, Percentage of Twist1-positive 
regions based on IHC staining of sections from tumours described in e 
(n=5 tumour sections per cell line, rank-sum P< 0.01). g, Percentage of 
Cdh1-positive regions based on IHC staining of sections from tumours 
described in e (n=5 tumour sections per cell line, rank-sum P< 0.01). 

h, Quantification of metastases resulting from the tumours described 

in e (rank-sum P< 0.05). i, Quantification of metastases resulting from 
intravenous injection of Tgf-6-silenced and -expressing cells (1 = 10 mice 
per cell line, rank-sum P< 0.05). See Source Data. 
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Extended Data Figure 10 | Tertiary validation that asparagine c, Relative Twist1 expression, as measured by qPCR, which were sorted 
availability regulates EMT. a, Representative H&E-stained sections of from the tumours and lungs of mice injected orthotopically with Asns- 
the tumours described in Fig. 2c, in which Asns-silenced and -expressing silenced and -expressing 4T1-T cells (1 =3 tumours and lungs per cell 
4T1-T cells were orthotopically injected into NSG mice. b, Images of line). d, Relative Cdh1 expression, as measured by qPCR, in the tumours 
cultured cells after they were isolated from the tumours and lungs of mice and lungs described in ¢ (n =3 tumours and lungs per cell line). 


injected orthotopically with Asns-silenced and -expressing 4T1-T cells. 
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Extended Data Figure 11 | Quaternary validation that asparagine 
availability regulates EMT. a, Representative images of IHC staining 

for Twist] and Cdh1 on sections from lungs described in Fig. 4e, in which 
mice were injected orthotopically with Asns-silenced and -expressing 
4T1-T cells. b, Quantification of all Twist] stainings, described in 

Fig. 4e and a (n=5 tumour sections and n > 5 lung metastases, edges of 
the box are the 25th and 75th percentiles and error bars extend to the 
values q3 + w(q3 — qi) and qi — w(q3 — qi), in which w is 1.5 and q, and q3 
are the 25th and 75th percentiles, which is also the case for c-g, rank-sum 
P<0.01 and P< 0.05 for Asns-silenced versus -expressing tumours and 
metastases, respectively). c, Quantification of all Cdh1 stainings, described 
in b (n=5 tumour sections and n = 9 lung metastases, rank-sum P< 0.01 
and P< 0.05 for Asns-silenced versus -expressing tumours and metastases, 
respectively). d, Quantification of Twist1-positive regions in the tumours 
resulting from orthotopic injection of Asns-expressing and -silenced 


0.0% 0.6% 4.0% 


4T1-T cells into animals treated with PBS or L-asparaginase (n = 5 tumour 
sections per condition, rank-sum P < 0.01 for Asns-silenced versus 
-unsilenced cells and rank-sum P< 0.05 for each cell line in treated versus 
untreated mice). e, Quantification of Cdh1-positive regions in the tumours 
described in d (n=5 tumour sections per condition, rank-sum P< 0.01 
for Asns-silenced versus -unsilenced cells and rank-sum P < 0.05 for each 
cell line in treated versus untreated mice). f, Quantification of Twist1- 
positive regions in tumours resulting from orthotopic injection of Asns- 
expressing and -silenced cells into mice fed a 0%, 0.6%, or 4% asparagine 
diet (n = 5 tumour sections per condition, rank-sum P < 0.01 between 
Asns-silenced and -expressing cells and between diets). g, Quantification 
of Cdh1-positive regions in the tumours described in f (n=5 tumour 
sections per condition, rank-sum P < 0.01 between Asns-silenced and 
-expressing cells and between diets). 
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Mitochondria-—lysosome contacts regulate 
mitochondrial fission via RAB7 GTP hydrolysis 


Yvette C. Wong’, Daniel Ysselstein! & Dimitri Krainc! 


Both mitochondria and lysosomes are essential for maintaining 
cellular homeostasis, and dysfunction of both organelles has 
been observed in multiple diseases!~*. Mitochondria are highly 
dynamic and undergo fission and fusion to maintain a functional 
mitochondrial network, which drives cellular metabolism*. 
Lysosomes similarly undergo constant dynamic regulation by 
the RAB7 GTPase, which cycles from an active GTP-bound state 
into an inactive GDP-bound state upon GTP hydrolysis. Here we 
have identified the formation and regulation of mitochondria- 
lysosome membrane contact sites using electron microscopy, 
structured illumination microscopy and high spatial and temporal 
resolution confocal live cell imaging. Mitochondria-lysosome 
contacts formed dynamically in healthy untreated cells and were 
distinct from damaged mitochondria that were targeted into 
lysosomes for degradation’. Contact formation was promoted by 
active GTP-bound lysosomal RAB7, and contact untethering was 
mediated by recruitment of the RAB7 GTPase-activating protein 
TBC1D15 to mitochondria by FIS1 to drive RAB7 GTP hydrolysis 
and thereby release contacts. Functionally, lysosomal contacts mark 
sites of mitochondrial fission, allowing regulation of mitochondrial 
networks by lysosomes, whereas conversely, mitochondrial contacts 
regulate lysosomal RAB7 hydrolysis via TBC1D15. Mitochondria- 
lysosome contacts thus allow bidirectional regulation of 
mitochondrial and lysosomal dynamics, and may explain the 
dysfunction observed in both organelles in various human diseases. 

Mitochondrial fission has multiple roles including mitochondrial 
biogenesis and mitochondrial DNA synthesis*®, and is regulated by the 
GTPase dynamin-related protein (DRP1), the endoplasmic reticulum, 
dynamin-2 and actin®"'°. By contrast, lysosomal dynamics are regulated 
by GTP-bound active RAB7, which is recruited to late endosomal- 
lysosomal membranes but dissociates upon RAB GAP (GTPase- 
activating protein)-mediated GTP hydrolysis to become inactive, 
GDP-bound, and cytosolic’'”. Contact sites between mitochondria 
and lysosomes could thus provide a potential cellular mechanism for 
simultaneously regulating these dynamics. 

Contacts between mitochondria and melanosomes, multi-vesicular 
bodies and yeast vacuoles have previously been studied”!*~°. Here, 
we identified contact sites between mitochondria and lysosomes in 
mammalian cells by performing electron microscopy on untreated 
HeLa cells. Mitochondria and lysosomes formed contacts (Fig. la 
and Extended Data Fig. la—c, yellow arrows) with an average distance 
between membranes of 9.57 +0.76nm, consistent with other contact 
sites*!?, and contact length of 198.33 + 16.73 nm (n=55 contacts from 
20 cells; Fig. 1b). Using correlative light electron microscopy (CLEM), 
we confirmed that lysosomes or late endosomes positive for the acidic 
organelle label LysoTracker Red contained ultrastructure electron- 
dense lumens with irregular content and/or multilamellar membrane 
sheets (Extended Data Fig. 1d) and could simultaneously contact mito- 
chondria and the endoplasmic reticulum (Extended Data Fig. le). 
Three-dimensional super-resolution structured illumination micro- 
scopy (N-SIM) of endogenous LAMP1 on late endosomal-lysosomal 


membranes, and TOM20 on outer mitochondrial membranes, further 
demonstrated that mitochondria—lysosome contacts spanned more 
than 200 nm in the Z-plane (n =210 examples from 26 cells; Fig. 1c 
(left) and Extended Data Fig. 1f). 

We next examined mitochondria-lysosome contacts in live cells 
using super-resolution N-SIM, and found that vesicles positive for 
LAMP1 labelled with mGFP (LAMP1-mGFP) and mitochondria 
expressing TOM20 labelled with mApple (mApple-TOM20) formed 
contacts in living HeLa cells (Fig. 1c, right). Using confocal microscopy 
at high spatial and temporal resolutions, mitochondria were found to 
contact both small (vesicle diameter <0.5 1m) and larger (vesicle dia- 
meter >1,1m) LAMP!1 vesicles (Extended Data Fig. 2a, b), and LAMP1 
vesicles could simultaneously contact multiple mitochondria (Extended 
Data Fig. 2c) and vice versa (Extended Data Fig. 2d). We also observed 
multiple examples of mitochondria—lysosome contacts stained for 
endogenous LAMP] and TOM20 under confocal microscopy (n= 341 
examples from 25 cells; Extended Data Fig. 2e). 

LAMP! vesicles and mitochondria remained in stable contact 
over time (Fig. 1d-g, yellow arrows; Supplementary Video 1), with 
LAMP!1 vesicles approaching mitochondria to form stable contacts 
(Fig. 1h, yellow arrows), but eventually leaving mitochondria (white 
arrow) without engulfing them (Extended Data Fig. 2f, g). Contacts 
observed by confocal microscopy and live cell N-SIM lasted for 10s 
or more (Fig. li and Extended Data Fig. 3a-c), with about 15% of 
LAMP! vesicles in the cell contacting mitochondria at any given 
time (Fig. 1j). Furthermore, sensitized emission fluorescence reso- 
nance energy transfer (SE-FRET) was observed between TOM20- 
Venus (outer mitochondrial membrane) and LAMP1-mTurquoise2 
(lysosomal membrane) at mitochondria—lysosome contacts 
(Extended Data Fig. 3d, e), further confirming the formation of these 
contacts in living cells. 

Next, we analysed whether mitochondria-lysosome contacts repre- 
sent sites of bulk protein transfer or mitochondrial degradation, either 
directly through mitochondrial-derived vesicles (MDVs) fusing with 
lysosomes’ or indirectly through mitophagy®. Intermembrane space 
mitochondrial proteins and mitochondrial matrix proteins (Fig. 1k and 
Extended Data Fig. 4a-f) were not bulk transferred into lysosomes, and 
conversely, lysosomal luminal content marked by dextran was not bulk 
transferred into mitochondria at contact sites (Fig. 1k and Extended 
Data Fig. 4g-i). Moreover, mitochondria in contact with lysosomes 
were substantially larger (over 500 nm) than MDVs (about 100nm)? 
and contained mitochondrial matrix proteins (Fig. 1k and Extended 
Data Fig. 4d-f), distinct from previously described TOM20-positive 
MDVs"?, Mitochondria contacting lysosomes also did not undergo 
mitophagy, as they were not engulfed by LC3-positive autophagosomes 
(Extended Data Fig. 4j) or positive for autophagosome biogenesis 
markers (Extended Data Fig. 4k), suggesting that mitochondria- 
lysosome contacts do not lead to the bulk transfer of organelle luminal 
content or bulk mitochondrial degradation. 

We then investigated whether mitochondria—lysosome contacts 
might be modulated by the lysosomal regulator RAB7 GTPase’. In 
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Figure 1 | Mitochondria and lysosomes form stable membrane 
contact sites. a, b, Representative electron microscopy image of 
mitochondria (M) and lysosome (L) contact (yellow arrows) in 
untreated HeLa cells (a) and quantification of distance between contact 
membranes and length of contact (b, n =55 examples from 20 cells). 

c, Representative structured illumination microscopy (N-SIM) images 
of mitochondria—lysosome contacts (yellow arrows) in fixed HeLa 

cells stained for endogenous LAMP!1 (lysosome) and TOM20 
(mitochondria) and imaged in Z-stacks showing contacts extending 
more than 200 nm in the Z-plane (3D N-SIM; left; 7 = 210 examples 
from 26 cells) and in living HeLa cells expressing LAMP1-mGFP and 
mApple-TOM20 (live N-SIM; right; n = 43 examples from 10 cells). 
d-h, Representative time-lapse confocal images of stable mitochondria 
lysosome contacts (yellow arrows) in living HeLa cells expressing 
LAMP1-mGFP (lysosomes) and mApple-TOM20 (mitochondria) 
(n= 67 examples from 23 cells). White arrows in h mark lysosomes 
before or after contact tethering to mitochondria. Black line shows 
duration of contact. i, j, Quantification of duration of mitochondria-— 
lysosome contacts (i) and percentage of lysosomes contacting 
mitochondria (for >10; j) from confocal time-lapse images (n = 45 
examples from 10 cells). k, Quantification of percentage of mitochondria 
(TOM20) or lysosomes (LAMP1) positive for mitochondrial matrix 
protein (mito—BFP; n = 104 events from 23 cells), mitochondrial 


intermembrane space (IMS) protein (SMAC-EGFP; n= 57 examples from 


12 cells), or lysosomal lumen marker (pulse-chased dextran; n = 66 events 


from 18 cells) at mitochondria-lysosome contacts in living HeLa cells. Data 


are means + s.e.m. (***P < 0.0001, unpaired two-tailed t-test). Scale bars, 
200 nm (a); 500 nm (c, 3D N-SIM); 500 nm (c, Live N-SIM; left, right); 
100 nm (c, Live N-SIM; middle); 1 {1m (d); 0.5 jm (e-h). 
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Figure 2 | RAB7 GTP hydrolysis promotes mitochondria-lysosome 
contact untethering. a—c, Representative time-lapse images of lysosome 
in cytosol (white arrow; top) approaching mitochondria to form a 

stable contact (yellow arrows; black line) before leaving mitochondria 
(white arrow; bottom) in living HeLa cells expressing mApple-TOM20 
(mitochondria) and lysosomal markers LAMP1-mGFP (a), RAB7-GFP 
(b) or constitutively active GTP-bound RAB7(Q67L)-GFP mutant unable 
to undergo GTP hydrolysis (c) (n =45 events from 9 cells per condition). 
d, Representative time-lapse images of mitochondria—lysosome contacts 
(yellow arrows) lasting more than 150s in RAB7(Q67L)-GFP cells (n = 45 
events from 9 cells). e-g, Expression of RAB7(Q67L) mutant leads to 
increased percentage of lysosomes in contacts (n = 12 cells per condition), 
and increased minimum duration of mitochondria-lysosome contacts 
(n= 45 events from 9 cells per condition). Data are means + s.e.m. 

(***P < 0.0001, unpaired two-tailed t-test). Scale bars, 1 {1m (a-c); 

0.5 41m (d). 


contrast to LAMP1-mGFP (Fig. 2a; Supplementary Video 2) or wild- 
type RAB7-GFP (Fig. 2b), expression of constitutively active GTP- 
bound mutant RAB7(Q67L)-GFP (Fig. 2c,d; Supplementary Video 3), 
which localized to lysosomal membranes, markedly increased both the 
percentage of lysosomes forming stable contacts with mitochondria 
(Fig. 2e) and mitochondria-lysosome contact duration (n = 45 events 
per condition; Fig. 2f, g). RAB7(Q67L) further resulted in a twofold 
increase in TOM20-LAMPI1 mitochondria-lysosome FRET intensity 
compared to wild-type RAB7 (n= 200 cells per condition; Extended 
Data Fig. 3f), suggesting that GTP-bound RAB7 promotes contact 
formation whereas RAB7 GTP hydrolysis may be required for mito- 
chondria—lysosome contact untethering. 
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We next examined how RAB7 GTP hydrolysis might be regulated 
at mitochondria-lysosome contacts. TBC1D15 is a RAB7 GAP that is 
recruited to mitochondria by the mitochondrial protein FIS17*”° to 
drive RAB7 GTP hydrolysis**”’, potentially allowing mitochondria to 
regulate both contact untethering and lysosomal RAB7 hydrolysis via 
TBCID15. Consistent with previous studies**°, mitochondrial local- 
ization of TBC1D15 was dependent on FIS1 binding (Extended Data 
Fig. 5a, d-f) but not inhibited by TBC1D15 mutants lacking GAP activity 
(D397A or R400K in the TBC domain)” (Extended Data Fig. 5b, c, e). 
Moreover, expression of mutant TBC1D15 could induce abnormally 
large lysosomes (Extended Data Fig. 5g), characteristic of inhibiting 
RAB7 GTP hydrolysis. 

Using live cell time-lapse imaging, we found that the GAP 
mutants TBC1D15(D397A) (Fig. 3a, b and Extended Data Fig. 6a, b; 
Supplementary Video 4) and TBC1D15(R400K) (Fig. 3c) markedly 
increased mitochondria—lysosome contact duration compared to wild- 
type TBC1D15 (n= 34-38 events per condition; Fig. 3d, e) but did not 
alter the percentage of lysosomes forming contacts with mitochondria 
(Extended Data Fig. 6c). TBCID15~/~ HCT116 cells, generated using 
transcription activator-like effector nucleases (TALENs) and previously 
characterized‘, also showed a similar increase in contact duration, but 
no change in contact formation (Extended Data Fig. 6d, e), suggesting 
that RAB7 GTP hydrolysis induced by TBC1D15 does not regulate 
contact formation, but rather regulates contact duration by promoting 
contact untethering upon GTP hydrolysis. 

Contact untethering was further dependent on the mitochondrial 
localization of TBCD15, as expression of a FIS1(LA) mutant that can- 
not recruit TBC1D15 to mitochondria” (Extended Data Fig. 5f) also 
induced abnormally enlarged lysosomes that contacted mitochondria 
(Fig. 3f), resulting in an increase in the duration and number of mito- 
chondria-lysosome contacts (Fig. 3f, g and Extended Data Fig. 6f). 
Consistent with these findings, FIS1 ~'~ HCT116 cells?4 also showed 
similar increases in contact duration and number (Extended Data 
Fig. 6g, h). However, localization of TBC1D15, FIS1 or RAB7 was not 
restricted to or concentrated at mitochondria—lysosome contact sites 
(Fig. 2b and Extended Data Fig. 6i,j). Together, these results suggest 
that RAB7 GTP hydrolysis is regulated at mitochondria—lysosome 
contacts by the GAP activity of TBC1D15, which is recruited to mito- 
chondria by FIS1. Inhibition of RAB7 GTP hydrolysis leads to both 
defective lysosomal morphology and mitochondria-lysosome contacts 
that are unable to untether, and consequently remain in contact for a 
longer duration. 

Finally, we investigated whether mitochondria—lysosome contacts 
also regulate mitochondrial dynamics. Time-lapse confocal microscopy 
showed that mitochondria underwent fission events at an average of 
1.44 events per min in live HeLa cells. Unexpectedly, sites of mito- 
chondrial fission were predominantly marked by a LAMP1 vesicle 
(yellow arrow) before the fission event (white arrows) (Fig. 4a—c, 
Extended Data Fig. 7a—c, Supplementary Videos 5, 6). LAMP 1 vesi- 
cles contacted mitochondria at 81.5% of mitochondrial fission sites 
(n= 44/54 events from 18 cells), which was significantly greater than 
expected by random chance (12.6%; ***P <0.0001, Fisher’s exact 
test; Fig. 4d) and greater than the percentage of contacts made by 
other vesicles such as early endosomes (GFP-EEA1) or peroxisomes 
(mEmerald—peroxisome) (<20% of fission events) (Fig. 4e). LAMP 1 
vesicles also localized to mitochondrial fission events at similar rates 
in other cell types including H4 neuroglioma, HEK293 and HCT116 
cells (Extended Data Fig. 7d-g) and upon induction of mitochondrial 
fragmentation using actinomycin D, staurosporine (STS) or carbonyl 
cyanide m-chlorophenylhydrazone (CCCP) (Extended Data Fig. 8a—d). 
Mitochondrial fission events marked by lysosomes were also positive 
for mCherry-DRP1 oligomerization (Extended Data Fig. 9a) and endo- 
plasmic reticulum tubules labelled with the endoplasmic reticulum 
markers mCherry-ER (100%; n = 54/54 events from 16 cells; Extended 
Data Fig. 9b, c), BFP-KDEL (100%; n = 24/24 events from 13 cells) or 
GFP-SEC618 (100%; n= 11/11 events from 11 cells), demonstrating 
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Figure 3 | Mitochondrial recruitment of TBC1D15, a RAB7 GAP, by 
FIS1 drives RAB7 GTP hydrolysis to promote mitochondria-lysosome 
contact untethering. a—c, Representative time-lapse images of stable 
mitochondria-lysosome contacts (yellow arrows) lasting more than 

300s in living HeLa cells expressing mApple-TOM20 (mitochondria), 
LAMP1-mGFP (lysosome) and TBC domain mutants TBC1D15(D397A) 
or TBC1D15(R400K) lacking GAP activity. d, Quantification of minimum 
duration of mitochondria—lysosome contacts, which are increased by 
TBC domain mutants compared to wild-type TBC1D15 (n= 34 events 
from 12 cells, wild-type; n = 38 events from 10 cells, D397A; n = 36 

events from 11 cells, R400K). *P=0.0404, ***P = 0.0002; ANOVA with 
Tukey’s post-hoc test. e, Duration frequencies of mitochondria-lysosome 
contacts. f, g, Expression of a FIS1(LA) mutant (unable to recruit 
TBC1D15 to mitochondria) leads to an increase in the minimum duration 
of mitochondria—lysosome contacts compared to wild-type FIS1 (n= 45 
events from 9 cells per condition). *P = 0.049, unpaired two-tailed t-test. 
Data are means +s.e.m. Scale bars, 1 1m (a, b); 0.5,1m (c); 54m (f); 1 um 
(f, insets). 


that mitochondria—lysosome contacts mark the sites of DRP1- and 
endoplasmic reticulum-positive mitochondrial fission events. 

As RAB7 GTP hydrolysis regulates mitochondria—lysosome con- 
tacts, we investigated whether it also regulates mitochondrial fission. 
Expression of RAB7(Q67L) markedly reduced the rate of mitochondrial 
fission (Fig. 4f), resulting in mitochondria that did not undergo 
fission over time (Extended Data Fig. 10a). In addition, both the GAP 
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Figure 4 | Mitochondria-lysosome contacts mark sites of mitochondrial 
fission regulated by RAB7 GTP hydrolysis. a, b, Representative time- 
lapse images of lysosomes contacting mitochondria at the site 

of mitochondrial division (yellow arrow) before fission (white arrows) 

in living HeLa cells expressing LAMP1-mGFP (lysosomes) and 
mApple-TOM20 (mitochondria) (n = 62 events from 23 cells). 

c, Linescan corresponding to Fig. 4a showing a lysosome contacting a 
mitochondrion pre-fission (yellow arrow; top) and remaining in contact 
post-fission (yellow arrow; middle) after the mitochondria has divided 
into two daughter mitochondria (grey arrows; middle). d, e, Percentage 
of mitochondrial division events marked by LAMP! vesicles in living 
HeLa cells expressing LAMP1-mGFP (lysosomes) and mApple-TOM20 
(mitochondria) (n = 54 events from 18 cells). Significantly more events 
were marked by LAMP1 vesicles (81.5%) than expected by random 
chance (12.6%; ***P < 0.0001, Fisher’s exact test), or by early endosomes 
(GFP-EEA1) (n= 45 events from 17 cells; ***P < 0.0001) or peroxisomes 
(mEmerald-peroxisome) (n= 49 events from 17 cells; ***P < 0.0001). 
f-h, RAB7(Q67L) GTP-hydrolysis deficient mutant (n= 10 cells, RAB7; 
n= 13 cells, RAB7(Q67L); ***P =0.0008), TBC1D15 GAP mutants 
(D397A or R400K) (n= 13 cells per condition; *P = 0.451, ***P=0.001) 
or FIS1 (LA) mutant (unable to bind TBC1D15) (n= 19 cells, FISI(WT); 
n= 18 cells, FIS1(LA); **P = 0.0027) lead to decreased rates of 
mitochondrial fission. Data are means + s.e.m. ANOVA with Tukey’s post- 
hoc test (e, g), unpaired two-tailed t-test (f, h). Scale bars, 0.5 41m (a, b). 


mutants TBC1D15(D397A) and TBC1D15(R400K) (Fig. 4g, Extended 
Data Fig. 10b-e), and FIS1(LA), which disrupts TBC1D15 mito- 
chondrial recruitment (Fig. 4h), markedly reduced mitochondrial 
fission rates. However, for the few fission events that did occur, the 
percentage of mitochondrial fission marked by lysosomes or endo- 
plasmic reticulum was not altered by RAB7(Q67L) (Extended Data 
Fig. 10f, g) or TBC1D15 mutants (Extended Data Fig. 10h, i), further 
confirming that the majority of fission events are positive for lyso- 
somes and endoplasmic reticulum. Moreover, inhibition of RAB7 
GTP hydrolysis by RAB7(Q67L) or TBCID15 GAP mutants reduced 
the percentage of cells with normal mitochondrial networks that were 
not hypertethered or overly elongated (Extended Data Fig. 10j-l). 
Thus, mitochondrial TBC1D15 recruited by FIS1 promotes RAB7 
GTP hydrolysis at mitochondria-lysosome contacts to regulate both 
lysosomal morphology and mitochondrial fission. 
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In summary, we propose that mitochondria—lysosome contacts are 
regulated in two steps: formation and stabilization of contacts pro- 
moted by lysosomal GTP-bound RAB7, followed by contact unteth- 
ering by TBC1D15, a RAB7 GAP recruited to mitochondria by FIS1, 
which drives RAB7 GTP hydrolysis at contact sites and results in disso- 
ciation of GDP-bound RAB7 from the membrane, which can no longer 
maintain stable contacts. 

In addition, our work suggests that mitochondria—lysosome con- 
tacts regulate at least two important aspects of mitochondrial and 
lysosomal dynamics. First, lysosomal RAB7 hydrolysis is regulated by 
mitochondrial TBC1D15, providing a mechanism for mitochondria to 
modulate lysosomal dynamics by shutting down active RAB7, which 
regulates lysosomal transport, fusion and maturation!. Of note, the 
distance between TBC1D15’s mitochondrial FIS1-binding site and 
its TBC GAP domain for driving lysosomal RAB7 GTP hydrolysis 
is sufficient to span the distance (about 10nm) between membranes 
at mitochondria—lysosome contact sites. This ability to regulate Rab 
GTP-GDP cycling on the opposing membrane of a target organelle may 
be similar to that proposed for GEF activation of the Golgi-localized RAB 
GTPase YPT1P by the TRAPPI complex on endoplasmic reticulum- 
derived COPII-coated vesicles”. 

Second, mitochondria—lysosome contacts mark sites of mito- 
chondrial fission, conversely allowing lysosomal RAB7 to regulate 
mitochondrial dynamics. Previous studies examining the role of 
TBCI1D15 in regulating mitochondrial morphology at steady state**?> 
and that of FIS1 in regulating the mitochondrial fission machinery 
have been controversial. Although our data suggest that both TBC1D15 
and FIS1 indirectly regulate mitochondrial fission events via lysosomal 
RAB7 GTP hydrolysis, further work examining their mechanistic role 
in this process will be important. As membrane contact sites mediate 
multiple forms of inter-organelle communication’*”’, we hypothesize 
that mitochondria—lysosome contacts also function as platforms for 
metabolic exchanges between the two organelles. Thus, future studies 
of additional roles and protein tethers involved at these contacts will 
provide valuable insight into cellular organization and the pathogen- 
esis of multiple diseases linked to both mitochondrial and lysosomal 
dysfunction? **”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. For studies involving 
multiple different experimental conditions in the same cell line, studies were per- 
formed on cells originating from the same cell line batch and randomly assigned 
experimental conditions for transfection. For preliminary analyses, researchers 
who were either involved or not involved in the study were asked to examine 
blinded samples for biological effects. 

Reagents. The following plasmids were obtained from Addgene: LAMP1-mGFP 
was a gift from E. Dell’Angelica (Addgene #34831)*!, LAMP1-RFP was a gift 
from W. Mothes (Addgene #1817)°*, BFP-KDEL, mito-BFP, mCherry-Drp1 and 
mCherry-RAB7A were gifts from G. Voeltz (Addgene #49150, #49151, #49152, 
#61804)'?3, EGFP-LC3 was a gift from K. Kirkegaard (Addgene #11546)*4, GFP- 
DFCP1 was a gift from N. Mizushima (Addgene #38269), pAc-GFPC1-Sec618 
was a gift from T. Rapoport (Addgene #15108), pCMV3-SMAC-HA-eGFP was a 
gift from R. Kahn (Addgene #67489), mVenus C1 was a gift from S. Vogel (Addgene 
#27794)°°, pKaznCMV-mClover3-mRuby3 was a gift from M. Lin (Addgene 
#74252)”, EGFP-RAB7A(WT) and EGFP-RAB7A(Q67L) were gifts from 
Q. Zhong (Addgene #28047, #28049)°8, mTagBFP2-Lysosomes-20”, mApple- 
TOMM20-N-10, mEmerald~TOMM20-C-10, DsRed2-Mito-7, mCherry- 
ATG5-C-18, mEmerald~ATG12-N-18, mCherry-ER-3, mEmerald—Peroxisome-2 
and pmTurquoise2-N1 were gifts from M. Davidson (Addgene #55308, #54955, 
#54281, #55838, # 54995, # 54003, #55041, #54228, #60561) and GFP-EEA1 wild 
type was a gift from S. Corvera (Addgene #42307)*°. N-terminal HA-tagged 
TBCID15 plasmids (wild-type, D397A, R400K and A231-240) and Flag-FIS1 
(wild-type and LA mutant) were gifts from N. Ishihara’>*!, YEP-TBC1D15 was a 
gift from R. Youle, ULK1-GFP was a gift from V. Deretic’”. The following rea- 
gents were also used: dextran cascade blue 10000MW (Thermo Fisher Scientific; 
D1976), LAMP1 rabbit antibody (Sigma-Aldrich, L1418), TOM20 mouse anti- 
body (BD biosciences, 612278), Flag rabbit antibody (Sigma-Aldrich, F7425), HA 
rabbit antibody (Cell Signaling, 3724S), HA mouse antibody (Cell Signaling, 
2367S) and Alexa fluorophore-conjugated secondary antibodies from Molecular 
Probes (Invitrogen). 

Cell culture and transfection. HeLa cells (gift from M. Schwake (ATCC)) and 
HEK293 cells (human embryonic kidney cell line 293FT (Life Technologies)) were 
cultured in Dulbecco’s modified Eagle's medium (DMEM) (Gibco; 11995-065) 
supplemented with 10% (vol/vol) FBS, 100 units per ml penicillin, and 100j1g/ml 
streptomycin. Wild-type, FISI~/~ and TBC1ID15~'~ HCT116 cells were gifts 
from R. Youle” and cultured in McCoy’s 5A with L-glutamine (ATCC 30-2007) 
supplemented with 10% (vol/vol) FBS, 100 units per ml penicillin, and 100j1g/ml 
streptomycin and nonessential amino acids. H4 neuroglioma cells*’ were cultured 
in Optimem + 5% FBS, 200j1g/ml geneticin and hygromycin and 1% penicillin/ 
streptomycin (Life Technologies), and treated with 1 g/ml doxycycline (Sigma) 
for 3 days. All cells were maintained at 37°C in a 5% CO? incubator and previously 
verified by cytochrome c oxidase subunit I (COI) and short tandem repeat (STR) 
testing, and were tested and found negative for mycoplasma contamination. Cells 
were transfected using Lipofectamine 2000 (Invitrogen). Dextran blue was used 
at 1 mg/ml and pulsed via incubation in medium for 15 min and chased for 4h, 
resulting in 95% of LAMP 1-positive vesicles containing dextran blue by this time 
point. For drug treatments, live cells were imaged while being treated for 20 min 
with actinomycin D (10\1M) (Sigma-Aldrich; A9415), STS (11M) (Sigma-Aldrich; 
$6942) or CCCP (201M) (Sigma-Aldrich C2759). For live imaging, cells were 
grown on glass-bottomed culture dishes (MatTek; P35G-1.5-14-C). 
Immunofluorescence. Cells were plated on coverslips and fixed in 3% (vol/vol) 
paraformaldehyde for 15 min and permeabilized with 2% BSA and 0.1% saponin. 
Fixed cells were incubated in primary antibody for 1 h, washed three times for 
5 min each, incubated in secondary antibody for 1h, washed three times for 5 min 
each, and mounted on glass slides with fluorescent mounting medium (Dako). 
Confocal microscopy. All non-FRET confocal images were acquired on a Nikon 
AIR laser scanning confocal microscope with GaAsp detectors using a Plan 
Apo ) 100x 1.45 NA oil immersion objective (Nikon) using NIS-Elements 
(Nikon). Live cells were imaged in a temperature-controlled chamber (37°C) at 
5% CO; at 1 frame every 2-3 s. Dual-colour videos were acquired as consecutive 
green-red images, and tricolour videos were acquired as consecutive green—-red- 
blue images. 

Electron microscopy. For electron microscopy (EM), cells were grown on coverslips 
and fixed in a mixture of 2.5% glutaraldehyde and 2% paraformaldehyde in 0.1 M 
cacodylate buffer for 2-24h at 4°C. After post-fixation in 1% osmium tetroxide 
and 3% uranyl acetate, cells were dehydrated in an ethanol series, embedded in 
Epon resin and polymerized for 48h at 60°C. Ultrathin sections were made using 
a UCT ultramicrotome (Leica Microsystems) and contrasted with 4% uranyl 
acetate and Reynolds’s lead citrate. Samples were imaged using a FEI Tecnai 
Spirit G2 transmission electron microscope (FEI) operated at 80kV. Images 
were captured with an Eagle 4k HR 200kV CCD camera. For correlative light 


electron microscopy, cells were grown on gridded glass-bottom culture dishes 
(MatTek; P35G-1.5-14-CGRD) and incubated for 45 min with LysoTracker Red 
(21M) (Thermo Fisher Scientific) before EM fixation. Fixed cells were imaged on 
the Nikon AIR laser scanning confocal microscope for LysoTracker staining using 
Z-stacks with step sizes of 0.2 1m as described above, and subsequently processed 
and imaged for EM as described above. 

Structured illumination microscopy. Structured illumination microscopy (SIM) 
super-resolution images were taken on a Nikon N-SIM system with a 100 oil 
immersion objective lens, 1.49 NA (Nikon). Images were captured using Nikon 
NIS-Elements and reconstructed using slice reconstruction in NIS-elements. 
Images for live cell imaging (live N-SIM) were taken at a single Z-plane, while 
images of fixed cells for 3D N-SIM were taken using Z-stacks with step sizes 
of 0.2 um. Cells used for live cell imaging were maintained in a temperature- 
controlled chamber (37°C) at 5% CO) in a TokaiHit stagetop incubator. 

FRET pair generation, imaging and analysis. The outer mitochondrial mem- 
brane (TOM20-Venus) and lysosomal membrane (LAMP 1-mTurquoise2) FRET 
pair was generated using mVenus C1 (Addgene #27794) and pmTurquoise2-N1 
(Addgene #60561). mRuby3 and RAB7a(Q67L)-mRuby3 were generated using 
mRuby3 obtained from pK,snCMV-mClover3—mRuby3 (Addgene #74252)°”. For 
FRET experiments, HeLa cells were plated on 35-mm 4-chamber glass-bottomed 
dishes (Cellvis) at a density of 40,000 cells per well. The following day, cells were 
transfected using lipofectamine with FRET pairs (TOM20-Venus and LAMP1- 
mTurquoise2) along with mRuby, RAB7a(WT)-mCherry or RAB7a(Q67L)- 
mRuby3. Images of live HeLa cells were acquired using a Nikon Spinning disk 
confocal microscope using 20x (for FRET intensity calculations) and 60x objec- 
tives (for representative time-lapse images) at excitation wavelengths of 445 nm, 
515nm, and 561 nm for mTurquoise2, Venus, and mCherry/mRuby3, respectively, 
in a temperature-controlled chamber (37°C) at 5% CO using NIS-Elements 
(Nikon). NIS-Elements (Nikon) was used for FRET analysis to calculate sensitized 
emission FRET (SE-FRET) and to unbiasedly generate regions of interest (ROI) by 
tracing individual cells in the red fluorescence view. A total of n = 200 cells were 
analysed per condition for RAB7a(WT) and RAB7a(Q67L) and the FRET intensity 
was normalized to average SE-FRET values for RAB7a(WT). 

Image analysis. Mitochondrial fission events were defined as those that showed 
clear division of a single mitochondrion into two distinct daughter mitochondria 
that moved independently of one another after division. The expected probability 
that a LAMP1 vesicle would be at the site of a mitochondrial division event by 
random chance was calculated as the density of LAMP1 vesicles in the cytosol 
from n= 26 living cells, using ImageJ (National Institutes of Health (NIH)). 
Mitochondria-lysosome contacts imaged in living cells were categorized as those 
that showed mitochondria and lysosomes in close proximity (<0.1 1m) for >10 s 
in time-lapse images. All contacts analysed for the minimum duration of con- 
tacts were those that had already formed at the beginning of the video. The mini- 
mum duration of contact in HeLa cells was quantified as the time before contact 
termination and dissociation (mitochondria and lysosomes detaching from one 
another) over a 5-min (300-s) video. Any contacts that lasted throughout the entire 
5-min video and were still in contact by the end of the video were categorized as 
300s in bar graphs and as >5 min in histograms for the minimum duration of 
mitochondria-lysosome contacts. The percentage of lysosomes in contacts was 
quantified as the percentage of vesicles that formed contacts (defined above) with 
mitochondria divided by the total number of vesicles in the region of interest. 
The minimum duration of contact in HCT116 cells was quantified from 
videos of >100s. Mitochondrial networks that did not contain overly elongated 
mitochondria (>10\1m length) or hyperfused or hypertethered mitochondria 
were classified as normal and scored per condition. The rate of mitochondrial 
fission was calculated per cell by quantifying the number of fission events in the 
entire cell from videos of > 100s. The distance between membranes and the length 
of mitochondria—lysosome contact sites were measured from EM images using 
ImageJ (NIH). Line scans were generated using ImageJ (NIH) and normalized 
per protein. 

Statistical analysis, graphing and figure assembly. Data were analysed using 
unpaired two-tailed Student t-test (for two datasets) or one-way ANOVA with 
Tukey’s post hoc test (for multiple datasets). Fisher’s exact test was used to compare 
the percentage of observed mitochondrial division events with mitochondria— 
lysosome contacts versus the percentage expected by random chance. Data pre- 
sented are means + s.e.m. (except in histograms). All statistical tests were justified 
as appropriate and were analysed from n> 9 cells (see text and figure legends for 
details) from n > 3 independent experiments (biological replicates) per condition. 
Statistics and graphing were performed using Prism 7 (GraphPad) software. All 
videos and images were assembled using Image] 1.51j8 (NIH). All final figures 
were assembled in Illustrator (Adobe). 

Data Availability. All data that support the findings of this study are included in 
the manuscript or are available from the authors upon reasonable request. 
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Extended Data Figure 1 | Correlative light electron microscopy 

and 3D structured illumination microscopy of mitochondria- 
lysosome contacts. a—c, Representative electron microscopy images of 
mitochondria (M) and lysosome (L) contacts (yellow arrows) in untreated 
HeLa cells (insets shown on right) (n =55 examples from 20 cells). 

d, e, Representative correlative light electron microscopy and confocal 
images of HeLa cells (from n= 14 images from 6 cells) incubated with 
LysoTracker Red to label lysosomes or late endosomes (red arrows) that 
contain electron-dense lumen with irregular content and/or multilamellar 
membrane sheets (d, see insets on right), and form a stable membrane 


contact site with mitochondria (e, yellow arrows; see inset on right), while 
simultaneously forming contact sites with the endoplasmic reticulum 

(e, purple arrows). Early endosomes lacking electron-dense lumen are 
LysoTracker-negative (d, blue arrows). f, Representative structured 
illumination microscopy (N-SIM) images of mitochondria-lysosome 
contacts (yellow arrows) in fixed HeLa cells stained for endogenous 
LAMP1 (lysosomes) or TOM20 (mitochondria) and imaged in Z-stacks 
showing contacts extending more than 200 nm in the Z-plane (n= 210 
examples from 26 cells). Scale bars, 200 nm (a-d); 100 nm (a-d, insets on 
right; e, left, middle); 50 nm (e, right); 500 nm (f). 
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Extended Data Figure 2 | Characterizing mitochondria-lysosome 
contacts in living cells. a-d, Representative images of mitochondria- 
lysosome contacts (lasting more than 10s) in living HeLa cells expressing 
LAMP1-mGFP (lysosomes) and mApple-TOM20 (mitochondria) (n = 23 
cells). a, Examples of small LAMP!1 vesicles (vesicle diameter <0.5|1m) 
contacting mitochondria. b, Examples of larger LAMP1 vesicles (vesicle 
diameter >1|1m) contacting mitochondria. c, Examples of a single LAMP1 
vesicles contacting multiple mitochondria. d, Examples of multiple 
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LAMP! vesicles contacting a single mitochondrion. e, Representative 
images of contacts (yellow arrows) in fixed HeLa cells stained for 
endogenous LAMP 1 (green) and TOM20 (red) (n= 341 examples from 

25 cells). f, g, Representative images of living HeLa cells (n= 23 cells) 
expressing LAMP1-mGFP (lysosomes) and mApple~-TOM20 (outer 
mitochondrial membrane) with corresponding linescans showing a 
mitochondria-lysosome contact at close proximity (f), distinct from 
lysosomal engulfment of mitochondrial TOM20 (g). All scale bars, 0.5 jim. 
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Extended Data Figure 3 | Structured illumination microscopy and 
FRET imaging of mitochondria-lysosome contacts in living cells. 
a-c, Representative N-SIM images (a, b) of mitochondria—lysosome 
contacts (yellow arrows) in living HeLa cells (n = 43 examples from 10 
cells) expressing LAMP1-mGFP (lysosomes) and mApple-TOM20 
(mitochondria) and quantification of duration of mitochondria- 
lysosome contacts from N-SIM time-lapse images (c). d, Model of newly 
generated FRET pairs targeted to the outer mitochondrial membrane 
(TOM20-Venus) and the lysosomal membrane (LAMP 1-mTurquoise2). 


e, Representative time-lapse images of a living HeLa cell (n = 200 cells) 
expressing FRET pairs (TOM20-Venus, LAMP 1-mTurquoise2) and 
RAB7a(Q67L)-mRuby3 demonstrating preferentially increased SE-FRET 
signal over 60s at the interface between mitochondria and lysosomes 
(white arrows). f, Quantification of normalized SE-FRET intensity per 
cell in conditions expressing wild-type RAB7a or RAB7a(Q67L) (n= 200 
cells per condition) showing an approximately twofold increase in cells 
expressing RAB7a(Q67L). Data are means + s.e.m. ***P < 0.0001, 
unpaired two-tailed t-test (f). Scale bars, 441m (a); 1 jm (b, e). 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | Mitochondria-lysosome contacts are distinct 
from mitochondria-derived vesicles and mitophagy. a—c, Representative 
images (a) of living HeLa cells expressing LAMP1-RFP (lysosomes), 
mito-BFP (mitochondrial matrix) and SMAC-EGFP (mitochondrial 
intermembrane space), and corresponding linescans (b, c) showing 

that mitochondrial intermembrane space and matrix proteins do not 
undergo bulk transfer into lysosomes at contacts (yellow arrows) (n =57 
events from 12 cells). d, e, Representative images (d) in a living HeLa cell 
expressing mApple-TOM20 (mitochondrial outer membrane), mito-BFP 
(mitochondrial matrix) and LAMP1-mGFP (lysosomes) and linescan 

(e, corresponding to top panel in d) showing that mitochondria that form 
contacts with lysosomes (yellow arrows) are positive for mitochondrial 
matrix protein mito-BFP and are not TOM20-positive MDVs (n= 104 
events from 23 cells). f, Representative linescan in a living HeLa cell 
expressing mEmerald-TOM20 (mitochondrial outer membrane), 
DsRed2-Mito (mitochondrial matrix) and mBFP2-Lys (lysosomes) 
showing that mitochondria that form contacts with lysosomes are positive 
for mitochondrial matrix protein DsRed2-mito and are not TOM20- 


positive MDVs (n= 94 events from 16 cells). g-i, Representative images 
(g) in a living HeLa cell expressing mApple-TOM20 (outer mitochondrial 
membrane), LAMP1-mGFP (lysosomal membrane) and fluid-phase 
marker dextran blue pulse-chased into the lysosomal lumen, and 
corresponding linescans (h, i) showing that lysosomal luminal contents 
(blue) do not undergo bulk transfer into mitochondria at contacts (yellow 
arrows) (n= 66 events from 18 cells). j, Representative images in a living 
HeLa cell expressing LAMP1-RFP (lysosomes), mito-BFP (mitochondrial 
matrix) and EGFP-LC3 (autophagosome) showing that mitochondria 
that form contacts with lysosomes (yellow arrows) are not engulfed by 
autophagosomes (not undergoing mitophagy) (n = 142 events from 

17 cells). k, Autophagosome biogenesis proteins (ULK1-GFP, mCherry- 
ATG5, mEmerald-ATG12, GFP-DFCP1 and EGFP-LC3) do not mark 
sites of mitochondria-lysosome contacts in living cells (number of events 
analysed in n= 14 cells (ULK1), n=17 cells (ATG5, ATG12, LC3) or 

n= 13 cells (DFCP1), top; quantification, bottom). Mitochondria (M) and 
lysosomes (L) are indicated in linescans. Data are means + s.e.m. Scale 
bars, 0.5 1m (a); 1 jum (d, g, j). 
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Extended Data Figure 5 | FIS1 recruits TBC1D15 to mitochondria. localization of YFP-TBC1D15 to mitochondria is greatly decreased by 
a-e, Representative images and quantification of localization of HA- the Flag-FIS1(LA) mutant (which cannot bind TBC1D15) as compared 
TBCI1D15 to mitochondria (stained with endogenous TOM20) in fixed to wild-type Flag—FIS1 (n= 290 cells, FIS1; n = 281 cells, FIS1(LA)). 
HeLa cells showing that mitochondrial localization is not disrupted by *** P< 0.0001, unpaired two-tailed t-test. g, Examples of HA~-TBC1D15 


TBC1D15 GAP mutants (D397A or R400K) but is disrupted by mutating GAP mutants (D397A and R400K) or FIS1-binding mutant (A231-240) 
the FIS1-binding site of TBC1D15 (A231-240) (n= 293 cells, WT; n=228 inducing enlarged lysosomes (white arrows) (LAMP1-mGFP) not 

cells, D397A; n= 181 cells, R400K; n = 379 cells, A231-240). A231-240 observed in cells expressing wild-type HA-TBC1D15 (n= 293 cells, WT; 
versus WT (*P=0.0178), D397A (*P=0.0131), and R400K (*P=0.0112), n= 228 cells, D397A; n= 181 cells, R400K; n = 379 cells, A231-240). Data 
ANOVA with Tukey’s post-hoc test. f, Quantification showing that are means + s.e.m. Scale bars, 101m (a-d, g); 1 {um (a-d, insets). 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Recruitment of TBC1D15 by FIS1 

to mitochondria promotes mitochondria-lysosome contact 
untethering. a, b, Representative time-lapse images of stable 
mitochondria—lysosome contacts (yellow arrows) for over 100s before 
untethering (white arrow) in living HeLa cells expressing mApple-TOM20 
(mitochondria), LAMP1-mGFP (lysosome) and the RAB7 GAP mutant 
TBC1D15(D397A) (n= 38 events from 10 cells). c, TBC domain mutants 
TBC1D15(D397A) and TBC1D15(R400K), which lack GAP activity, 

do not alter the percentage of lysosomes in contacts (n = 12 cells per 
condition), as compared to wild-type TBC1D15 (N.S., not significant). 
d, e, TBC1D15~'~ HCT116 cells have increased duration (d, n= 18 
events from 6 cells, WT; 1 = 16 events from 7 cells, TBC1D15~‘~) but 

no change in the number of mitochondria-lysosome contacts (e, n= 15 
cells, WT; n= 14 cells, TBCID15~'~) compared to wild-type HCT116 


LETTER 


cells (*P < 0.0491, N.S., not significant). f, Expression of the Flag- 
FIS1(LA) mutant (unable to bind TBC1D15) increases the percentage 
of lysosomes in mitochondria-lysosome contacts compared to wild- 
type FIS1 in living HeLa cells (n = 18 cells, FIS1; n = 16 cells, FIS1(LA); 
*P<0.0117). g, h, FIS1 ~'~ HCT116 cells have an increased duration 

(g, n= 18 events from 6 cells, WT; n= 14 events from 6 cells, FIS1 =a) 
and number of mitochondria—lysosome contacts (h, n= 15 cells, WT; 
n= 13 cells, FISI~'~) compared to wild-type HCT116 cells (*P < 0.0442, 
**P < 0.0001). i, j, Localization of HA-TBC1D15 (i, n= 293 cells) and 
Flag-FIS1 (j, n = 272 cells) to mitochondria in fixed HeLa cells is not 
restricted to mitochondria-lysosome contacts. Data are means + s.e.m. 
ANOVA with Tukey’s post-hoc test (c), unpaired two-tailed t test (d-h). 
Scale bars, 0.5 41m (a); 1 jum (b, i (insets), j (insets)); 10 um (i, j). 
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Extended Data Figure 7 | Mitochondrial fission sites are marked 

by mitochondria-lysosome contacts in multiple cell types. 

a, b, Representative time-lapse images of lysosomes contacting 
mitochondria at site of mitochondrial fission (yellow arrow, top) 

before mitochondrial fission (white arrows, middle) in living HeLa 

cells expressing mGFP-LAMP1 (lysosomes) and mApple-TOM20 
(mitochondria) with corresponding linescans (right) showing lysosomes 
at the site of fission (yellow arrow; linescan) after mitochondrial 
division into two daughter mitochondria (grey arrows, linescan) (n = 62 
events from 23 cells). c, Electron microscopy image of mitochondria 
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(M) in contact (<30nm) with a lysosome (L; yellow arrows) at site of 
mitochondrial constriction in untreated HeLa cells (from n = 20 cells 
imaged). d-g, Lysosomes (yellow arrows in e-~g; mGFP-LAMP1) mark 
sites of mitochondrial fission (white arrows in e-g; mApple-TOM20) at 
similar rates (d) in living H4 neuroglioma, HEK293 and HCT116 cells as 
in HeLa cells by time-lapse confocal imaging (mn = 49 events from 10 cells, 
HeLa; n = 36 events from 13 cells, H4; n= 18 events from 9 cells, HEK293; 
n=9 events from 6 cells, HCT116). Data are means +s.e.m. N.S., not 
significant, ANOVA with Tukey’s post-hoc test. Scale bars, 1 zm 

(a, b, e-g, insets); 200 nm (c); 2.5 1m (e-g). 
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Extended Data Figure 8 | Mitochondria-lysosome contacts mark with actinomycin D (a), STS (b) or CCCP (c) (n=49 events from 10 cells, 
sites of mitochondrial fission upon induction of mitochondrial control; n= 29 events from 14 cells, actinomycin D; n = 36 events from 
fragmentation. a—d, Lysosomes (yellow arrows; mGFP-LAMP1) mark 10 cells, STS; n = 49 events from 14 cells, CCCP). Data are means + s.e.m. 
sites of mitochondrial fission (white arrows; mApple-TOM20) at similar N.S., not significant, ANOVA with Tukey’s post-hoc test. Scale bars, 5 4m 
rates (d) in untreated living HeLa cells as in cells treated for up to 20 min (a-c); 11m (a-c, insets). 
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Extended Data Figure 9 | Mitochondrial fission sites marked by time-lapse images shown in c) of a lysosome (mBFP2-Lys) contacting 


lysosomes are positive for DRP1 and endoplasmic reticulum tubules. mitochondria (mEmerald-TOM20) at the site of mitochondrial division 
a, Representative time-lapse images of a lysosome (mBFP2-Lys) (yellow arrow) before fission (white arrows) in a living HeLa cell 
contacting mitochondria (mEmerald~-TOM20) at the site of mitochondrial showing an endoplasmic reticulum tubule (mCherry-ER) at the site of 
division (yellow arrow) before fission (white arrows) in a living HeLa cell mitochondrial division (n = 54 events from 16 cells). Scale bars, 1 jum 
showing mCherry—DRP1 oligomerization at the site of mitochondrial (a, c); 54m (b). 
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Extended Data Figure 10 | See next page for caption. 
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Extended Data Figure 10 | Regulation of mitochondrial network 
dynamics by RAB7 GTP hydrolysis. a, Examples of mitochondria not 
undergoing fission for more than 120s in living HeLa cells expressing 
mApple-TOM20 (mitochondria) and RAB7(Q67L)-GFP (n= 13 cells). 
b, c, Examples of mitochondria undergoing fission (white arrows) after 
36 s in living HeLa cells expressing mApple-TOM20 (mitochondria) and 
wild-type TBC1D15 (n= 13 cells). d, e, Examples of mitochondria not 
undergoing fission for more than 240s in living HeLa cells expressing 
mApple-TOM20 (mitochondria) and GAP mutants TBC1D15(D397A) 
(d) or TBC1D15(R400K) (e) (n = 13 cells per condition). f-i, The 
percentage of mitochondrial fission sites marked by lysosomes (mGFP- 
LAMP; f, h) or endoplasmic reticulum (mCherry-ER; g, i) is not 
disrupted by the RAB7(Q67L) GTP-hydrolysis-deficient mutant 

(f, g; n= 12 events from 15 cells) or by TBC1D15 GAP mutants (D397A or 


R400K) (h, i; 7 = 22 events from 10 cells, WT; n= 17 events from 19 cells, 
D397A; n= 27 events from 22 cells, R400K). j-I, Examples of RAB7(Q67L) 
and HA~TBC1D15 GAP mutants (D397A and R400K) inducing elongated 
mitochondria (j, yellow arrows; >10,1m length) compared to control cells, 
and quantification of RAB7(Q67L) (k; *P=0.0321) and HA-TBC1D15 
GAP mutants (D397A and R400K) (1; *P =0.0297, **P=0.0051) 

leading to decreased percentages of cells with normal mitochondrial 
networks (no elongated mitochondria >10,1m length or hyperfused or 
tethered networks) (n= 47 cells, RAB7; n= 72 cells, RAB7(Q67L); n= 88 
cells, TRC1D15 WT; n= 168 cells, TBC1D15(D397A); n= 132 cells, 
TBC1D15(R400K)). Data are means + s.e.m. N.S., not significant; ANOVA 
with Tukey’s post-hoc test (h, i, 1), unpaired two-tailed t-test (f, g, k). Scale 
bars, 0.5 1m (a); 11m (b-e); 101m (j). 
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Structural basis for DNMT3A-mediated de novo 


DNA methylation 


Zhi-Min Zhang!+*, Rui Lu?**, Pengcheng Wang“, Yang Yu*, Dongliang Chen’, Linfeng Gao’, Shuo Liu‘, Debin Ji’, 
Scott B Rothbart**®, Yinsheng Wang*°, Gang Greg Wang?’s & Jikui Song!4s 


DNA methylation by de novo DNA methyltransferases 3A 
(DNMT3A) and 3B (DNMT3B) at cytosines is essential for 
genome regulation and development!”. Dysregulation of this 
process is implicated in various diseases, notably cancer. However, 
the mechanisms underlying DNMT3 substrate recognition 
and enzymatic specificity remain elusive. Here we report a 
2.65-angstr6ém crystal structure of the DNMT3A-DNMT3L-DNA 
complex in which two DNMT3A monomers simultaneously attack 
two cytosine-phosphate-guanine (CpG) dinucleotides, with the 
target sites separated by 14 base pairs within the same DNA duplex. 
The DNMT3A-DNA interaction involves a target recognition 
domain, a catalytic loop, and DNMT3A homodimeric interface. 
Arg836 of the target recognition domain makes crucial contacts 
with CpG, ensuring DNMT3A enzymatic preference towards CpG 
sites in cells. Haematological cancer-associated somatic mutations 
of the substrate-binding residues decrease DNMT3A activity, 
induce CpG hypomethylation, and promote transformation of 
haematopoietic cells. Together, our study reveals the mechanistic 
basis for DNMT3A-mediated DNA methylation and establishes its 
aetiological link to human disease. 

Mammalian DNA methylation is an important epigenetic mech- 
anism crucial for gene silencing and imprinting, X-inactivation, 
genome stability, and cell fate determination*. It is established mainly 
at CpG dinucleotides by the de novo methyltransferases DNMT3A 
and DNMT3B!”, and subsequently maintained by DNA methyl- 
transferase 1 (DNMT1) in a replication-dependent manner*. The 
enzymatic function of DNMT3A and DNMT3B is further regulated 
by DNMT3-like protein (DNMT3L) in germ and embryonic stem 
(ES) cells>-7. Deregulation of DNMT3A and DNMT3B is associated 
with various human diseases including haematological cancer®*"”. 
However, the molecular mechanisms underpinning DNMT3A- 
mediated methylation, especially substrate recognition and catalytic 
preference towards CpG, remain elusive. Here we generated a pro- 
ductive DNMT3A-DNMT3L-DNA complex using the C-terminal 
domains of DNMT3A and DNMT3L (Fig. 1a). The DNA molecule 
consists of a 10-mer central CpG-containing DNA strand annealed to 
an 11-mer 2’-deoxyzebularine (dZ)-containing strand (target strand), 
which results in a (CpG)-(dZpG) sequence context and allows for- 
mation of stable covalent DNMT3A-DNA complexes (Extended Data 
Fig. la, b). The crystal structure of the DNMT3A-DNMT3L in complex 
with 10/11-mer DNA, bound to cofactor by-product S-adenosyl-.- 
homocysteine (AdoHcy), was subsequently determined at 3.1 A reso- 
lution (Extended Data Fig. Ic). 

The structure of this DNMT3A-DNMT3L-DNA complex reveals 
a tetrameric fold arranged in the order of DNMT3L-DNMT3A- 


DNMT3A-DNMT3L, reminiscent of its DNA-free form!” (Extended 
Data Figs 1d and 2a). Notably, two DNA duplexes, each bound to one 
DNMT3A monomer, are separated by approximately 15 A, implying a 
total of 14-base-pair spacing between the two active sites of DNMT3A 
(Extended Data Fig. 1d). This finding prompted us to design a longer 
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Figure 1 | Structure of the DNMT3A-~DNMT3L tetramer in complex 
with a 25-mer DNA duplex containing two CpG sites. a, Domain 
architectures of DNMT3A and DNMT3 L, with the C-terminal domains 
marked with arrowheads. b, DNA sequence used for structural study. 

Z, zebularine. c, d, Ribbon (c) and surface (d) representations of 
DNMT3A-DNMT3L bound to DNA and AdoHcy. The zebularines 
anchored at the two active sites are 14 base pairs (bp) away and shown 
in expanded views, with hydrogen-bonding interactions depicted as 
dashed lines and the F, — F, omit map (pink) of Z5 or Z20’ contoured at 
the 3a level. 
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Figure 2 | Structure comparison of free and DNA-bound DNMT3A- 
DNMT3L tetramer. a, Schematic view of the intermolecular interactions 
between DNMT3A and DNA. The hydrogen-bonding, electrostatic, and 
van der Waals contacts are represented by red, blue, and black arrows, 
respectively. Water-mediated hydrogen bonds are labelled with the letter “W’. 
b, Structural overlay of free (grey) and DNA-bound (cyan) DNMT3A- 


DNA substrate involving a self-complementary 25-mer zebularine 
(Z)-containing DNA, with two (CpG)-(ZpG) sites across 14 base 
pairs (Fig. 1b). The structure of the DNMT3A-DNMT3L in complex 
with 25-mer DNA was determined at 2.65 A resolution (Extended 
Data Fig. 1c), revealing only one DNA duplex bending towards the 
DNMT3A-DNMT3L tetramer, with the two CpG sites simultaneously 
anchored by two DNMT3A monomers (Fig. 1c, d and Extended Data 
Fig. 2b). Our data thus support the notion that the two DNMT3A 
monomers can co-methylate two adjacent CpG dinucleotides in one 
DNA-binding event'”!’. Despite being crystallized under different con- 
ditions, both DNMT3A-~-DNMT3L-DNA complexes are well aligned 
with their DNA-free state, with a root-mean-square deviation of 0.87 A 
and 1.12 A over 790 and 826 Ca atoms, respectively (Extended Data 
Fig. 2a). Notably, in DNMT3A-DNMT3L-DNA complexes, zebular- 
ines are flipped out of the DNA helix and inserted deep into DNMT3A 
catalytic pockets, where they are covalently anchored by the catalytic 
cysteine C710 and hydrogen bonded to E756, R790, and R792 (Fig. 1c 
and Extended Data Fig. 1d). Because both structures reveal productive 
reaction states with consistent protein-DNA interactions, we focus on 
the structure of DNMT3A-DNMT3L in complex with 25-mer DNA 
for further analysis. 

DNMTS3A binding to DNA is mainly mediated by a loop from the 
target recognition domain (TRD) (residues R831-F848), the cata- 
lytic loop (residues G707-K721), and the homodimeric interface of 
DNMT3A, which together create a continuous DNA-binding surface 
(Figs 1d and 2a). Accordingly, these segments exhibit the most promi- 
nent structural changes upon DNA binding: the TRD loop lacked elec- 
tron density in the DNA-free structure of DNMT3A-DNMT3L!!!”, but 
became well defined upon DNA binding and penetrated into the DNA 
major groove for intermolecular contacts (Fig. 2a—c); additionally, the 
TRD loop is stabilized through hydrogen-bonding interactions with 
R882, the DNMT3A mutational hotspot among leukaemias” !®, and 
Q886 from an adjacent helix (Fig. 2c). Meanwhile, the catalytic loop 
residue V716 moves towards the DNA minor groove by approximately 
2A, intercalating into the DNA cavity vacated because of zebularine 
base flipping (Fig. 2d, e). Although no protein-DNA contact was 
observed for DNMT3L, two DNMT3L-contacting helices of DNMT3A 
are preceded by DNA-binding loops (Extended Data Fig. 2c), rein- 
forcing the notion that DNMT3L enhances DNMT3A functionality 
through stabilizing its DNA-binding sites’. 
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DNMT3L. The disordered TRD loops in free DNMT3A-DNMT3L are 
depicted as dotted lines. c, The TRD loop (blue) in DNA-bound DNMT3A 
is stabilized by hydrogen-bonding interactions (dashed lines) with R882 
and Q886. d, Structural overlay of the catalytic loop between free (grey) 
and DNA-bound (blue) DNMT3A. e, DNA intercalation by DNMT3A 
V716. 


Recognition of CpG dinucleotides by DNMT3A is mediated by both 
catalytic and TRD loops. In particular, guanine of the target strand, 
G6 (G19’), is specified by a hydrogen bond between its O06 atom and 
the Ne atom of R836 from the TRD loop, as well as water-mediated 
hydrogen bonds between its N7 atom and the Ne and O7 atoms of 
R836 and T834, respectively (Fig. 3a). Meanwhile, the catalytic loop 
approaches the minor groove, where the backbone carbonyl oxygen 
of V716 forms a hydrogen bond with the N2 atom of the unpaired 
guanine G5’ (G20) (Fig. 3b). Penetration of the catalytic loop also 
allows V716 and P718 to engage van de Waals contacts with the base 
of G6 (G19’), providing additional base-specific recognition (Fig. 3b). 
No protein interaction was associated with C6’ (C19) of the non-target 
strand, providing an explanation for the observation that DNMT3A 
does not discriminate hemimethylated against unmethylated DNA’. 
Formation of the DNMT3A-DNA complex is also supported by various 
protein-DNA interactions flanking CpG, which involve electrostatic 
and/or hydrogen-bonding interactions of the TRD residues (R831, 
1832, T835, N838, and K841), catalytic loop residues (N711, $714, 
and 1715) and DNMT3A-DNMT3A homodimeric interface residues 
(S881, R882, L883, and R887) with various DNA backbone or base sites 
(Fig. 2a and Extended Data Fig. 3a—f). These DNA-binding residues are 
highly conserved in DNMT3B (Extended Data Fig. 3g), suggesting a 
similar substrate engagement mechanism used by the DNMT3 family. 

To determine the roles for CpG-engaging residues R836 and V716 in the 
regulation of DNMT3A activity, we performed mutagenesis followed 
by enzymatic studies using CpG-, CpA-, or CpT-containing substrates 
(Fig. 3c and Extended Data Fig. 4a, b). First, wild-type DNMT3A 
(DNMT3A™") showed methylation efficiency for CpG-containing 
DNA more than 20-fold higher than for CpA- or CpT-containing DNA, 
confirming its well-known CpG specificity'*. By contrast, mutation 
of R836 to alanine (R836A) enhanced methylation of CpA- and CpT- 
containing DNA by 5.2- and 4.2-fold, respectively, but, as previously 
reported’, only led to slight change in CpG methylation. As a result, 
the relative CpG/CpA and CpG/CpT preference of the DNMT3AR®°4 
enzyme was reduced by 4.5- and 3.7-fold, respectively, supporting a 
role for R836 in substrate specificity determination. Consistent with 
these observations, we solved the structure of the DNMT3A8864_ 
DNMT3L-DNA complex, which lacks R836-mediated hydrogen bonds 
to CpG without causing overall structural alterations (Extended Data 
Fig. 4c). Mutation of V716 to glycine (V716G) abolished methylation 
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Figure 3 | DNMT3A-CpG interactions. a, b, Interactions of R836 (a) 
and V716-P718 (b) with one CpG site. The hydrogen bonds and water 
molecule are shown as dashed lines and a purple sphere, respectively. 
Nucleotides in the second CpG site are shown in parentheses. ¢, In vitro 
methylation for 40 min using DNMT3A“!-DNMT3L or DNMT3A8*6A_ 
DNMT3L complex (n = 3 biological replicates); c.p.m., counts per minute. 
WT, wild type. d, Composition of all called methylated cytosines (mC) 

in TKO ES cells reconstituted with DNMT3A? or DNMT3A8°4, 

A methylated cytosine is defined using a binomial distribution-based filter 
with FDR less than 1% (n = 21,678,839 for wild type and n= 25,272,362 for 
R836A). e, f, eRRBS revealing averaged CpG and non-CpG methylations, 
either at global levels (e) or at a representative major satellite DNA region 
(f), that are induced by DNMT3A! versus DNMT3A®**4 among three 
independent lines of TKO ES cells. Shown in e are box and whisker plots 
of 10-kb-bin-averaged methylation levels for each sequence context. 
Symbols +/—, forward/reverse strand. The box and whiskers depict the 
interquartile and 1.5 x interquartile ranges, respectively. g, Individual 
bisulfite sequencing detecting the methylation level of CG, CA, CT, or 

CC sites within a major satellite DNA site at chromosome 2 in TKO cells 
expressing DNMT3A™? or DNMT3A®* (7 = 4). Data are mean +s.d. 
Statistical analysis used two-tailed Student's t-test. ***P < 0.001. 


of all tested substrates (Extended Data Fig. 4d). These observations 
support the idea that R836-mediated CpG engagement contributes to 
substrate specificity whereas V716-mediated intercalation is essential 
for DNMT3A-mediated catalysis. The increased in vitro activity of 
DNMT3A®®%64 on CpA and CpT suggests that R836 might energet- 
ically influence the enzymology of DNMT3A, in addition to target 
recognition. In the case of CpG DNA, such an influence might be partly 
compensated by the R836-mediated hydrogen bonds, thereby ensuring 
the CpG specificity of DNMT3A. 

Next, we introduced comparable levels of DNMT3A, either wild type 
or the above CpG-engagement-defective mutants, into ES cells with 
compound knockouts of DNMT1, DNMT3A, and DNMT3B (TKO) He 
and detected a global increase in cytosine methylation after rescue with 
DNMT3A™? or DNMT3A®64, but not DNMT3AV7!6S (Extended 
Data Fig. 4e, f). Furthermore, genome-wide methylation profiling 
with enhanced reduced representation bisulfite sequencing (eRRBS), 
followed by calling of methylation using the previously described 
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binomial model and false discovery rate (FDR)-based threshold!718, 


revealed that, in TKO ES cells reconstituted with DNMT3AW 5, 58% 
and 42% of methylated cytosines were presented at CpG and non-CpG 
sites, respectively (Fig. 3d and Extended Data Fig. 5a—c); by contrast, 
such a distribution was reversed in cells expressing DNMT3A8864, 
with 31% and 69% of methylated cytosines found in CpG and non- 
CpG contexts (Fig. 3d and Extended Data Fig. 5b, c). Consistently, 
relative to wild-type controls, the absolute methylation levels were found 
to be decreased at CpG, but increased at CpA and CpC, sites among cells 
with DNMT3A**°*4, especially at sites showing intermediate-to-high 
levels of methylation (Fig. 3e and Extended Data Figs 5d and 6a). 
These changes were persistent among all chromosomes, at both DNA 
strands and over all annotated genes (Extended Data Fig. 6b-d), as 
exemplified by those detected at the major satellite DNA repeats 
(Fig. 3f) and gene-coding regions of Foxp1 and Dock1 (Extended Data 
Fig. 6e). Sanger bisulfite sequencing further validated eRRBS results at 
major satellite repeats in ES cells’? (Fig. 3g and Extended Data Fig. 7). 
DNMT3A’”!°¢ abolished both CpG and non-CpG methylations at 
major satellite DNA (Extended Data Fig. 7b-d). The above observa- 
tion that DNMT3A**4 decreases overall CpG methylations in TKO 
cells might be due to competition of non-CpG as a potential substrate 
for this mutant enzyme. Collectively, we demonstrate that engaging 
CpG by the R836 side chain ensures DNMT3A substrate specificity 
in cells. 

Notably, heterozygous mutation of DNMT3<A at its DNA-binding 
residues, such as $714, V716, P718, R792, T835, R836, N838, K841, 
and R882 (Figs 2a and 4a, b), occurs recurrently in haematological 
cancer®!0 and overgrowth syndrome”!. Although recent studies 
support a dominant-negative effect of the hotspot R882H mutation on 
DNMT3A-mediated methylation, possibly through affecting DNMT3A 
tetramerization?*~*°, our structural observation raises the possibility 
that interfering with the DNA binding via residue substitution also 
results in functional impairment of DNMT3A during pathogenesis. 
Indeed, in vitro enzymatic assays showed the significantly reduced 
activity for all tested DNA-binding mutants, with the most pronounced 
effect observed for V716D, R792H, and K841E (Fig. 4c and Extended 
Data Fig. 8a—d). Consistently, expression of these three mutants in 
TKOES cells failed to restore global DNA methylation (Extended Data 
Fig. 8e, f). It is worth noting that, although DNMT3A®®°W exhibited 
modestly reduced overall activities (Extended Data Fig. 8c, f), its activi- 
ties for non-CpG methylations were found to be significantly increased 
at the major satellite DNA in TKO cells and in in vitro enzymatic assays 
(Extended Data Fig. 8g, h), suggesting a potential role for R836W in 
the redistribution of CpG versus non-CpG methylations in diseased 
cells. Given a largely heterozygous feature of DNMT3A mutations 
in leukaemia, we also queried whether the DNA-binding-defective 
mutants of DNMT3A inhibit functionality of DNMT3A". To test 
this, we turned to a co-expression system used previously for studying 
the dominant-negative DNMT3A8887H mutant?>, and reconstituted 
wild-type and mutant DNMT3A in equal amounts into TKO ES cells 
(Fig. 4d). Relative to expression of wild type alone, co-expression of 
DNMT3A¥71°?, DNMT3A°H, or DNMT3A**41¥ with DNMT3A"* 
significantly decreased overall cytosine methylation (Fig. 4d). Together, 
we show that the DNMT3A mutants defective in substrate binding 
not only decrease activity but also interfere with that of DNMT3A™". 

We further ectopically expressed the above DNMT3A mutants 
in TF-1 cells, a model used for studying leukaemia-associated gene 
mutation*®. Through DNA methylation array profiling and bisulfite 
sequencing validation, we observed a significant reduction in over- 
all CpG methylations in TF-1 cells stably expressing DNMT3A‘7!©, 
DNMT3A"7974 or DNMT3AS*4!. relative to control; by contrast, 
ectopic expression of DNMT3A™" induced hypermethylation (Fig. 4e 
and Extended Data Fig. 9). There was significant overlap among CpG 
sites showing hypomethylation due to expression of DNMT3A¥7!©, 
DNMT3A®™4 or DNMT3A‘*4E (Extended Data Fig. 10a), indicating 
their common effect on epigenomic deregulation. Reduced methylation 
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Figure 4 | Haematological cancer-associated mutations of the 
DNMT3A-DNA interaction residues. a, b, Surface (a) and stick (b) 
views of the DNA-contacting residues in DNMT3A found mutated 

in haematological cancer. Mutation sites are coloured blue in a. The 
hydrogen-bonding interactions and water molecule are shown as dashed 
lines and a purple sphere, respectively. c, In vitro methylation of CpG 
DNA using DNMT3A™" or its haematological cancer-associated mutants 
(n=3). d, DNMT3A immunoblots (top) and liquid chromatography- 
tandem mass spectrometry (LC-MS/MS and MS/MS/MS)-based 
quantification of 5-methyl-2'-deoxycytidine (5-mdC; bottom, n=3 
biological replicates) in DNA isolated from TKO ES cells re-expressing 


of these commonly affected sites was also detected after transduction of 
other leukaemia-associated substrate-binding mutations of DNMT3A 
(P718L, T835M, R836W, and N838D), although it did not induce hypo- 
methylation globally (Extended Data Fig. 10b, c). Binding of wild-type 
or mutant DNMT3A was comparable at tested loci showing methylation 
changes (Extended Data Fig. 10d, e). Given that epigenetic deregula- 
tion promotes TF-1 cell transformation characterized by cytokine- 
independent growth”®, we queried whether the DNA-binding-defective 
mutation of DNMT3A causes a similar transformation of this model. 
We found that, under cytokine-supporting conditions, TF-1 cells 
expressing wild-type or mutant DNMT3A exhibited comparable pro- 
liferation (Extended Data Fig. 10f). By contrast, those expressing a 
DNA-binding-defective mutant, but not DNMT3AW had significant 
cytokine-independent growth capability (Fig. 4f and Extended Data 
Fig. 10g). Collectively, we demonstrate that the DNA-binding residues 
of DNMT3A are vital for establishing appropriate CpG methylation 
in haematological cells and that their somatic mutations detected in 
patients with leukaemia promote transformation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Protein expression and purification. The gene fragments encoding residues 
628-912 of human DNMT3A (NCBI accession number NM_022552) and residues 
178-386 of human DNMT3L were inserted in tandem into a modified pRSFDuet-1 
vector (Novagen). The DNMT3A sequence was separated from the preceding 
Hisg-SUMO tag by a ubiquitin-like protease (ULP1) cleavage site. Expression and 
purification of the DNMT3A-DNMT3L complex followed a previously described 
protocol’”. In brief, the Hiss-SUMO-DNMT3A fusion protein and DNMT3L were 
co-expressed in Escherichia coli BL21 DE3 (RIL) cell strains and purified using 
a Ni**-NTA column. Subsequently, the Hiss~-SUMO tag was removed through 
ULP1-mediated cleavage, followed by ion exchange chromatography on a Heparin 
column. For enzymatic assays, the DNMT3A-DNMT3L complex was further puri- 
fied through size-exclusion chromatography on a HiLoad 16/600 Superdex 200 pg 
column (GE Healthcare), and concentrated to 0.1-0.3 mM in a buffer containing 
20 mM Tris-HCl (pH 8.0), 100 mM NaCl, 0.1% -mercaptoethanol, and 5% 
glycerol. To generate the covalent DNMT3A-DNMT3L-DNA complex, an 
11-mer single-stranded DNA that was in-house synthesized to contain 2’-deoxy 
zebularine”® (5/-CATGdZGCTCTC-3'; dZ, 2/-deoxyzebularine) was annealed with 
a 10-mer single-stranded DNA (5‘-AGAGCGCATG-3’) before reaction with the 
DNMT3A-DNMT3 L complex in the presence of 20 mM Tris-HCl (pH 7.5), 50 mM 
NaCl, 20% glycerol, and 40 mM DTT at room temperature. In addition, a 25-mer 
zebularine-containing DNA (5’-GCATGZGTTCTAAT TAGAACGCATG-3’; 
Z, zebularine) was self-annealed and used to form a second DNMT3A-DNMT3L- 
DNA complex or the DNMT3A83¢A_DNMT3L-DNA complex. The reaction 
products were further purified through a HiTrap Q XL column (GE Healthcare), 
followed by size-exclusion chromatography on a HiLoad 16/600 Superdex 200 
pg column. The final samples for crystallization of the productive DNMT3A- 
DNMT3L-DNA complexes contained about 0.1-0.2 mM covalent DNMT3A- 
DNMT3L-DNA complexes, 0.3 mM AdoHcy, 20 mM Tris-HCl (pH 8.0), 100 mM 
NaCl, 0.1% 8-mercaptoethanol, and 5% glycerol. 

Crystallization conditions and structure determination. The crystals for 
the covalent complex of DNMT3A-DNMT3L with the 10/11-mer DNA were 
generated by the hanging-drop vapour-diffusion method at 23°C, from drops 
mixed from 0.511 of DNMT3A-DNMT3L-DNA solution and 0.5 11 of precipitant 
solution (7% PEG4000, 0.1 M Tris-HCl (pH 8.5), 100 mM MgCh, 166 mM imida- 
zole (pH 7.0)). The reproducibility and quality of crystals were further improved 
by the micro-seeding method. The crystals were soaked in cryoprotectant made 
of mother liquor supplemented with 30% PEG400, before being flash frozen in 
liquid nitrogen. For the complex of DNMT3A (either wild-type or R836A mutant) 
with DNMT3L and the 25-mer DNA, crystals were generated by the hanging-drop 
vapour-diffusion method at 4°C, from drops mixed from 1.511 of the protein 
solution and 1.5 11 of precipitation solution (0.1 M Tris-HCl (pH 7.0), 200 mM 
NaH 2POq, and 5% PEG4000). The crystals were treated with cryoprotectant 
containing the precipitation solution and 30% glycerol before collecting. 

X-ray diffraction datasets for the covalent DNMT3A-DNMT3L-DNA 
complexes were collected at selenium peak wavelength on the BL501 or BL502 
beamlines at the Advanced Light Source, Lawrence Berkeley National Laboratory, 
and the dataset for the covalent DNMT3A (R836A)-DNMT3L-DNA complex 
was collected on the 24-ID-E NE-CAT beamline at the Advanced Photon Source, 
Argonne National Laboratory. The diffraction data were indexed, integrated, and 
scaled using the HKL 2000 program” or the XDS program”. The structures of the 
productive covalent complexes of DNMT3A-~DNMT3L-DNA were solved using 
the molecular replacement method in PHASER*,, with the DNA-free structure of 
DNMT3A-DNMT3L (PDB 2QRV) serving as a search model. Further modelling 
of the covalent DNMT3A-DNMT3L-DNA complexes was performed using 
COOT* and then subject to refinement using the PHENIX software package*’. 
The same R-free test set was used throughout the refinement. The final models 
for DNMT3A-DNMT3L complexed with the 25-mer and 10/11-mer DNAs 
were refined to 2.65 A and 3.1 A resolution, respectively. The final model for 
DNMT3A (R836A)-DNMT3L complexed with the 25-mer DNA was refined to 
3.0 A resolution. 

The statistics for data collection and structural refinement of the productive 
covalent DNMT3A-DNMT3L-DNA complexes are summarized in Extended 
Data Fig. Ic. 

In vitro DNA methylation assay. Synthesized (GAC) 12, (AAC),2 and (TAC) 12 
DNA duplexes were used as CG-, CA- and CT-containing substrates, respectively. 
The DNA methylation assays were performed in triplicate at 37°C for 1h, 
unless otherwise indicated. In brief, a 20-1l reaction mixture contained 2.5 1M 
S-adenosyl-t-[methyl-°H] methionine (AdoMet) (specific activity 18 Cimmol”}, 
PerkinElmer), 0.3 .M DNMT3A-DNMT3L, 0.75 1M DNA in 59 mM Tris-HCl, pH 
8.0, 0.05% 3-mercaptoethanol, 5% glycerol, and 200 1g ml! BSA. The methylation 
reactions were stopped by flash freezing in liquid nitrogen, followed by precipita- 
tion and incubation on ice for 1h, in 1 ml of 15% trichloroacetic acid solution plus 


40\.g ml“! BSA. The trichloroacetic acid-precipitated samples were then passed 
through a GF/C filter (GE Healthcare) using a vacuum-filtration apparatus. After 
sequential washing with 2 x 5 ml of cold 10% trichloroacetic acid and 5 ml of 
ethanol, the filters were dried and transferred to scintillation vials filled with 5 ml 
of ScintiVerse (Fisher), followed by measurement of tritium scintillation with a 
Beckman LS6500 counter. 

Plasmid construction. Full-length human DNMT3A isoform 1 was cloned into 
the EcoRI site of the pPyCAGIZ vector (a gift from J. Wang). DNMT3A mutation 
was generated by a QuikChange II XL Site-Directed Mutagenesis Kit (Agilent). 
To achieve co-expression of the wild-type and mutant DNMT3A at equal levels in 
cells, we engineered a T2A-based fusion construct consisting of the mutant cDNA, 
which was added with an N-terminal 3 x Flag-(GGGGS)3-Myc tag to differentiate 
its protein size from non-tagged wild-type DNMT3A, followed by a T2A peptide 
sequence at its C terminus and the cDNA of non-tagged wild-type DNMT3A. 
Myc-tagged full-length homan DNMT3<A isoform 1 was cloned into MSCV Pac 
retroviral vector as previously described". All plasmid sequences were verified 
by sequencing. 

Cell lines and cell culture. Dumt3a, Dnmt3b, and Dnmt1 TKO mouse ES cells 
(a gift from M. Okano)'® were cultivated on gelatin-coated dishes in the high- 
glucose DMEM base medium (Invitrogen) supplemented with 15% of fetal bovine 
serum (FBS, Invitrogen), 1 x non-essential amino acids (Invitrogen), 0.1 mM 
8-mercaptoethanol, and 1,000 U ml“! leukaemia inhibitory factor (ESGRO). The 
TF-1 human erythroleukaemic cell line was obtained from American Type Culture 
Collection (ATCC) and cultivated in the RPMI 1640 base medium (Invitrogen) 
supplemented with 10% of FBS and 2 ng ml“! of recombinant human granulocyte- 
macrophage colony-stimulating factor (GM-CSF; R&D Systems). Acquisition of 
the cytokine-independent growth of TF-1 cells by the introduction of cancer- 
associated gene mutation was examined and quantified upon GM-CSF removal 
as previously described”®. 

Authentication of cell line identities, including those of parental and derived 
lines, was ensured by the Tissue Culture Facility affiliated to the University of 
North Carolina at Chapel Hill Lineberger Comprehensive Cancer Center using 
the genetic signature profiling and fingerprinting analysis previously described™. 
Every 1-2 months, a routine examination of cell lines in culture for any possible 
mycoplasma contamination was performed using commercially available detection 
kits (Lonza Walkersville). 

Generation of stable cell lines. TKO ES cells were transfected by Lipofectamine 
2000 (Invitrogen) with the pPyCAGIZ empty vector or that carrying wild-type 
or mutant DNMT3A. Forty-eight hours after transfection, the transduced ES 
cells were selected out in 50j1g ml! Zeocin (Invitrogen) for 10 days. The pooled 
stable-expression cell lines and independent single-cell-derived clonal lines were 
continuously maintained in the medium with 251g ml! zeocin. To generate TF-1 
leukaemia cell lines with stable expression of wild-type or mutant DNMT3A, the 
MSCV-based retrovirus was packaged in HEK293 cells and used for infection as 
previously described**. Forty-eight hours after infection, TF-1 cells were selected 
by 2,.g ml“! puromycin for 4 days and maintained in medium with 11g ml! 
puromycin. 

Western blotting. Antibodies used for western blotting were anti- MYC (Sigma- 
Aldrich, 9E10), anti-DNMT3A (Santa Cruz, H-295), anti-3-actin (Santa Cruz, 
sc-47778), and «-tubulin (Sigma-Aldrich). Total protein samples were prepared by 
cell lysis with SDS-containing Laemmli sample buffer followed by brief sonication. 
Extracted samples equivalent to 100,000 cells were loaded to the SDS-PAGE gels 
for western blot analysis. 

Quantification of 5-methyl-2’-deoxycytidine and 2’-deoxyguanosine in 
genomic DNA. The measurement procedures for 5-methyl-2'-deoxycytidine 
(5-mdC) and 2’-deoxyguanosine in genomic DNA were described previously**”. 
In brief, 1 xg of genomic DNA prepared from cells was enzymatically digested 
into nucleoside mixtures. Enzymes in the digestion mixture were removed by 
chloroform extraction, and the resulting aqueous layer was concentrated to 10,11 
and subjected directly to LC-MS/MS and LC-MS/MS/MS analysis for quantifica- 
tion of 5-mdC and 2'-deoxyguanosine, respectively. The amounts of 5-mdC and 
2'-deoxyguanosine (in moles) in the nucleoside mixtures were calculated from area 
ratios of peaks found in selected-ion chromatograms for the analytes over their 
corresponding isotope-labelled standards, the amounts of the labelled standards 
added (in moles), and the calibration curves. The final levels of 5-mdC, in terms 
of percentages of 2'-deoxyguanosine, were calculated by comparing the amounts 
of 5-mdC relative to those of 2’-deoxyguanosine. 

eRRBS and data analysis. Genomic DNA of each sample was added with 0.5% 
of unmethylated lambda DNA (Promega) as a spike-in control and subjected to 
eRRBS using MethylMidi-seq (Zymo Research) as described before”. In brief, 
approximately 300 ng of DNA were digested with three restriction enzymes 
(80 units of MspI, 40 units of Bfal, and 40 units of Msel) to improve genomic 
DNA fragmentation and coverage. The generated DNA fragments were ligated 
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to the pre-annealed 5-methyl-cytosine-containing adapters, followed by filling 
in overhangs and the A extension at the 3’ terminus. The DNA fragments were 
then purified and subjected to bisulfite treatment using an EZ DNA Methylation- 
Lightning Kit (Zymo Research). After amplification, the quality of eRRBS libraries 
was checked with Agilent 2200 TapeStation, followed by deep sequencing using 
an Illumina HiSeq-2000 genome analyser (50 base pairs and paired end as 
parameters). Obtained reads were aligned to in silico bisulfite-converted mouse 
reference genome mm9 and lambda DNA sequence (GenBank accession number 
J02459.1) using the Bismark package in a strand-specific manner**. For identi- 
fication of methylated cytosines, all mapped cytosines were subjected to a bino- 
mial distribution model-based methylation calling as described in the scetion 
below. To determine distribution of methylation levels, only those high-quality 
reads with at least 15 times coverage were used. For convenience of data anal- 
ysis and to increase data complexity, data from all three biological replicates 
were merged and cytosine sites covered with at least 15 reads in the merged 
dataset were used for downstream analysis such as averaged methylation levels 
in 10-kb window sliding and aggregated methylation levels across genes. Data 
representation and plots were generated with the ‘ggplot2’ package in R software 
using custom scripts. 

Identification of methylated cytosines. We used a previously described binomial 
model to identify methylated cytosines!”!*. Specifically, with the unmethylated 
spike-in lambda DNA, we first determined the bisulfite non-conversion rate 
(probability, P) for each cytosine sequence context independently (that is, 
CpG, CpA, CpC, and CpT). For each mapped cytosine in our eRRBS data, we 
calculated the binomial P value in which methylated reads occurred out of 
the total read number on the basis of the binomial test, with the bisulfite non- 
conversion rate as the success probability (P). If a P value was under a threshold, 
we defined the cytosine as truly methylated. To determine the FDR for each 
different threshold, we created a control methylome for each eRRBS sample. In 
the control methylome, read depth at each cytosine was equal to the real data, 
and the methylated events were simulated by binomial distribution using the 
previously defined non-conversion rate (P). The FDR was determined by the 
ratio between the number of identified methylated cytosine sites from the con- 
trol methylome and that from the real data. For each eRRBS sample, we chose 
to use a P value under which the FDR was less than 1% or 0.1%, as specified in 
figure legends. 

DNA methylation array and data analysis. Genomic DNA was extracted and 
bisulfite-converted as described above. DNA methylation profiling using an 
Illumina Infinitum HumanMethylation450 BeadChip array was performed by the 
UNC Genomics Core according to the manufacturer's instructions. Methylation 
data were then subject to background subtraction and control normalization by 
executing preprocessIllumina in the R ‘minfi’ package*’. Differentially methylated 
CpGs were identified using dmpFinder in a categorical mode. Methylation changes 
were considered significant at a q-value of less than 0.05 and a value difference 
of more than 0.1. Hierarchical clustering analysis, scatter plots, and density plots 
were generated in R using ‘pheatmap’ and ‘ggplot2’ packages. 

Sanger bisulfite sequencing. Sanger bisulfite sequencing was performed as previ- 
ously described“. In brief, genomic DNA was prepared using a DNeasy Blood and 
Tissue Kit (Qiagen) and 1 jug genomic DNA subject to bisulfite conversion using 
an EZ DNA Methylation Gold Kit according to manufacturer’s instructions (Zymo 
Research). Bisulfite-treated DNA was then used as template in PCR to amplify 
the target DNA region, followed by cloning of PCR products into pCR2.1-TOPO 
vector (Invitrogen) for direct sequencing of individual clones. Four biological 
replicates per cell line were tested, with at least ten clonal sequences per replicate 
generated. The primers used for amplifying a major satellite DNA sequence located 
at chromosome 2 were 5/-GGG AAT TTT GGT GGT AGG GT-3’ and 5’-AAA 
AAA CAT CCA CTT AAC TAC TTA AAA A-3’. The primers used for validating 
450k array data were, for EIF4G1, 5/-AGG AGA TTG AGG TTT TAG TGA ATA 
TGT-3/ and 5’-CCC TAT ATC AAA TTC TTC CTA CCA TAA-3’; for HDLBP, 
5!-GGA GGT GAA GTT ATG GAG ATA TTT TT-3’ and 5’-ATC CCA TAC CAA 
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CAA AAA CTA ACA A-3’; for FOXK2, 5'-TAT GTT TGT ATT TGG GGT GTT 
TTT T-3’ and 5’-CTA AAA AAT CAA AAA CAT TTC CTA CC-3’. 

Chromatin immunoprecipitation. Samples used for chromatin immunoprecipi- 
tation (ChIP) were prepared as previously described’. In brief, chromatin samples 
extracted from cells expressing Myc-tagged DNMT3A were used for ChIP with the 
9E10 anti-Myc antibody (Sigma-Aldrich), with cells expressing empty vector used 
as negative control. Real-time PCR was performed for detecting DNMT3A binding 
at sites listed below. The primers for ChIP-PCR at each tested site were as follows: 
for cg23189692, 5’-TTG GCA TGC TCA CAG AGA GG-3/ and 5'-GTC TTC CCA 
GGC TCA TTG CT-3’; for cg00704780. 5‘-AGC AAA ACG GTC AGT AGC CA-3/ 
and 5/-TAC CAG CAA AAG CTG GCA GG-3’; for cg10460657, 5’-GCC TCT 
GAC CTG CTG TCT AC-3’ and 5’-AGG AAA TGC CCC AGA CGT G-3; for 
cg07564962, 5’-GGC CGG CAC TAA TGT CTT TC-3’ and 5’-TTC CCT GCT 
CTG TGG GAA GG-3’; for cg13393476, 5’-CCT TGC GAG TGA GTC ACG G-3/ 
and 5/-GAG ATT CTG CCA GGC TCC AC-3’; for cg20509869, 5’-GTG GGA 
CGC TAA CCC TCT TC-3’ and 5'‘-GGC GGC TGA TTT ATC TGG GT-3’; and 
for GAPDH transcription start site, 5/-TCT CCC CAC ACA CAT GCA CTT-3’ 
and 5’-CCT AGT CCC AGG GCT TTG ATT-3’. 

Statistics. No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded to allocation 
during experiments and outcome assessment. Data are presented as the mean +s.d. 
of at least three independent experiments. Statistical analysis was performed with 
a Student's t-test for comparing two sets of data with assumed normal distribution. 
A P value of less than 0.05 was considered to be significant. 

Code availability. The scripts for genomic data analyses and all other data are 
available from the corresponding authors upon reasonable request. 

Data availability. Coordinates and structure factors for DNMT3A-DNMT3L in 
complex with 25-mer DNA, DNMT3A-DNMT3L in complex with 10/11-mer 
DNA, and DNMT3A (R836A)-DNMT3L in complex with 25-mer DNA have 
been deposited in the Protein Data Bank under accession numbers 5YX2, 6F57, 
and 6BRR, respectively. The eRRBS and Illumina Human Methylation 450K array 
data have been deposited in NCBI Gene Expression Omnibus under accession 
number GSE99391. 
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d Mi Free DNMT3A-DNMT3L 


mm DNMT3A-DNMT3BL bound to 10/11-mer DNA 
DNMT3A-DNMT3BL bound to 25-mer DNA 


Extended Data Figure 1 | Structures of the DNMT3A-DNMT3L 
tetramer in complex with the 10/11-mer DNA. a, The sequence of the 
10/11-mer DNA duplex used for structural study. b, Chemical formula 
of the covalent adduct of DNMT3A and 2/-deoxyzebularine. c, Data 
collection and refinement statistics. Each dataset was collected from a 
single crystal. d, Ribbon representations of the DNMT3A-DNMT3L 
tetramer in complex with the 10/11-mer DNA duplex and AdoHcy. 
DNMT3A, DNMT3L, and DNA are coloured in light blue, green, and 
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wheat, respectively, and AdoHcy shown in sphere representation. The 
boxed areas show expanded views for the CpG sites (purple and yellow), 
the DNA-binding TRD and catalytic loops (left box), and the flipped out 
2'-deoxyzebularine (dZ6’) surrounded by conserved catalytic residues 
(right box). The F, — F, omit map of 2’-deoxyzebularine (pink) is 
contoured at the 3a level. The hydrogen-bonding interactions are depicted 
as dashed lines. 
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Extended Data Figure 2 | Intermolecular interactions between the DNMT3L (shown in green) are coloured in pink (a4 and a5 in accordance 
DNMT3A-DNMT3L tetramer and DNA. a, Structural overlay of free with the numeration in Extended Data Fig. 3g) and preceded by two 
DNMT3A-DNMT3L (PDB 2QRV) with the 10/11-mer DNA and 25-mer DNA contact loops, coloured in blue. The flipped out zebularine (Z5) 
DNA-bound states. b, Stick representation of the 25-mer DNA duplex is coloured in purple. The bound AdoHcy molecule is shown in sphere 


bound to the DNMT3A-DNMT3L tetramer, with the 2F, — F, omit map representation. 
contoured at the 1a level. c, The two helices of DNMT3A that interact with 
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Extended Data Figure 3 | Various intermolecular interactions 
between the two DNMT3A monomers and DNA. a-f, DNA binding 

by the first and second DNMT3A monomer (defined as 3A and 3A’, 
respectively, in Fig. 1c) includes the intermolecular interactions between 
the TRD loop of DNMT3A and the DNA major groove (a, d), between 
the catalytic loop of DNMT3A and the DNA minor groove (b, e), and 
between the homodimeric interface of DNMT3A and the DNA backbone 
(c, f). The hydrogen-bonding interactions are shown as dashed lines. 
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The water molecules are shown as purple spheres. g, Structure-based 
sequence alignment of DNMT3 proteins from human (hDNMT3A and 
hDNMT3B), mouse (mDnmt3a and mDnmt3b), and zebrafish (zDnmt3a 
and zDnmt3b). Completely conserved residues are coloured in white 

and highlighted in red. Partly conserved residues are coloured in red. 
Secondary structures are shown above the aligned sequences. The DNA- 
binding residues as revealed by this study are marked with black triangles. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | The essential roles for the CpG-engaging 
residues of DNMT3A, R836, and V716 in DNMT3A-mediated 

CpG versus non-CpG methylations. a, b, DNA methylation 

kinetics analysis of wild-type (DNMT3A 5, a) and R836A-mutated 
(DNMT3A®*4, b) DNMT3A using the CpG-, CpA-, or CpT-containing 
DNA substrates (n = 3 biological replicates). Purified DNMT3A- 
DNMT3L tetramer complex was used for measurements, followed by 
fitting with a first-order exponential equation. c, Ribbon representation 
of the crystal structure of DNMT3A**°A_DNMT3L tetramer in complex 
with the 25-mer DNA, with the CpG recognition by one of the DNMT3A 
monomers shown in expanded view. The 2F, — F- omit map of DNA was 


contoured at the 0.80 level, and coloured in light blue. d, Methylation 
assay using either DNMT3A" or DNMT3A‘! on CG-, CA-, and 
CT-containing DNA (n= 3 biological replicates). e, Immunoblots 

detect reconstituted expression of the indicated DNMT3A among TKO 
mouse ES cells, either the pooled stable-expression cell population or 
single-cell-derived clonal lines. EV, empty vector. A representative blot 

of two independent experiments is shown. For gel Source Data, see 
Supplementary Fig. 1. f, LC-MS analysis reveals the global 5-methyl-2’- 
deoxycytidine (5-mdC) levels (calculated as 5-mdC/2’-deoxyguanosine on 
the y axis) in the TKO ES cells after stable transduction of empty vector or 
the indicated DNMT3A (n=3 biological replicates). Data are mean + s.d. 
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Extended Data Figure 5 | eRRBS reveals distribution of cytosine 
methylations in each sequence context among TKO ES cells with 
reconstituted expression of either DNMT3A”! or DNMT3A®*, 

a, The rates of bisulfite conversion for the indicated sequence context in 
each sample as determined by the unmethylated lambda DNA spike-in 
control. CpN, all cytosines. b, ¢, Pie charts showing the percentage of 
methylated cytosines (total number n shown at the bottom of each plot) 
identified among the DNMT3A7- or DNMT3A®*°4 expressing TKO ES 


cells in each sequence context. The methylated cytosines were called using 
a stringent binomial distribution-based filter to eliminate false positives 
from incomplete bisulfite conversion, with an FDR of 1% and 0.1% set 

for b and ¢, respectively. d, Distribution of methylation levels (percentage 
on x axis) for the indicated sequence context. Inserts show a closed view 
of the distribution at sites with intermediate to high levels of cytosine 
methylation. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | CpG and non-CpG methylations induced 
by DNMT3A7 versus DNMT3A®*°* in the TKO ES cells. a, Overall 
methylation levels of cytosines at four different sequence contexts 

as detected by eRRBS. On the y axis, the averaged methylation level 

of all cytosines within the mouse genome is shown, as calculated by 
normalization of the detected methylation cytosines over total cytosine 
numbers in the TKO ES cells reconstituted with either DNMT3A? (left) 
or DNMT3A%*** (right). b, Global levels of CpG and CpH (H=A, C, 
or T) methylation induced by DNMT3A7 (green) or DNMT3A®84 
(red) across the mouse chromosomes of the TKO ES cells. Box and 
whisker plots of 10-kb-bin-averaged methylation levels of each mouse 
chromosome are shown. c, Global levels of CpG and CpH methylation 
induced by DNMT3A" (green) versus DNMT3A®* (red) across all 
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annotated genes. Each gene was divided into 100 equally sized bins and 
the 10-kb flanking region was divided into 50 equally sized bins. Averaged 
methylation levels were plotted for each bin. TSS, transcription start site; 
TES, transcription end site. d, Global levels of CpG and CpH methylation 
induced by DNMT3A" (green) versus DNMT3A®*°4 (red) on the two 
opposite DNA strands. Boxplots of 10-kb-bin-averaged CpG, CpA, CpC, 
and CpT methylation levels of each strand are shown. e, Representative 
gene-wide views of CpG and CpH methylations at Foxp1 and Dock1, 
which are grouped into either the forward (+) or reverse (—) DNA strand. 
Cytosines covered by at least 15 reads from eRRBS data are shown, with 
each site designated by a vertical line. Panels a-e use the combined dataset 
of three biological replicates per group. Box plots depict the interquartile 
range, and whiskers depict 1.5 x interquartile range. 
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Extended Data Figure 7 | Sanger bisulfite sequencing to validate the 
cytosine methylation levels mediated by DNMT3A, either wild type or 
defective in recognizing the CpG substrate, among the TKO mouse ES 
cells. a, Sequence of the examined major satellite DNA region. Primers 
used for bisulfite PCR are denoted with 5’ and 3’ primer pairing. The 
counts for cytosines, highlighted in colour, are 9 for the CG dinucleotide, 
33 for CA, 14 for CT, and 13 for CC. b, A representative result for bisulfite 
sequencing analysis of the major satellite repeat region described above 
in the TKO cells expressing empty vector, wild-type DNMT3A, or the 
indicated mutant. Each row represents one DNA clone and each column 
represents one site of cytosine, either methylated (filled) or unmethylated 
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(open). c, d, Percentage of methylation mediated by DNMT3A or the 
indicated mutant at CpG (c) and non-CG (d) sites within the examined 
major satellite DNA region in the TKO ES cells. Data are mean +s.d.; n=4 
independent bisulfite sequencing experiments as shown in b. e-h, Average 
cytosine methylation levels at each individual site grouped by the CpG (e), 
CpA (f), CpT (g), or CpC (h) context in the examined major satellite 

DNA among the TKO ES cells reconstituted with DNMT3A versus 
DNMT3A®* (7 = 4 biological replicates; mean + s.d., with the labelled 
P values). Statistical analysis used a two-tailed Student’s t-test; NS, 

not significant. 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | Effect of haematological cancer-associated 
mutations of DNMT3A on DNA methylation in vitro and in mouse 
TKO ES cells. a, Methylation kinetics of DNMT3A with mutations located 
at the catalytic loop, compared with DNMT3A= b, Methylation kinetics 
of DNMT3A with an active site mutation, R792H. c, Methylation kinetics 
of DNMT3A with mutations located at the TRD loop. d, Methylation 
kinetics of DNMT3A with the hotspot mutation R882H. For a-d, 
DNMT3A-DNMT3L complex was used for the measurements (n = 3 
biological replicates), followed by fitting with a first-order exponential 
equation. These data were measured independently from those shown in 
Fig. 4c. e, Immunoblot detects stable reconstitution of DNMT3A™! or 
the indicated DNMT3A mutant in the TKO ES cells. A representative 
blot of two independent experiments is shown. For gel Source 

Data, see Supplementary Fig. 1. f, LC-MS analysis reveals the global 


5-methyl-2’- deoxycytidine (5-mdC) levels (calculated as 5-mdC/ 
2'-deoxyguanosine) in the TKO ES cells after stable transduction of 
empty vector or the indicated DNMT3A. The methylation levels relative 
to TKO cells expressing DNMT3A? are plotted. Data are mean +s.d.; 
n=4 biological replicates. g, Individual bisulfate sequencing detects 

the methylation level of cytosines in each sequence context within a 
major satellite DNA site at chromosome 2 in the TKO cells reconstituted 
with DNMT3A®*°6W (right), compared with DNMT3A! (left) or 
DNMT3A** (middle; as determined in Fig. 3g) (n= 4; mean +s.d.). 
Statistical analysis used a two-tailed Student's t-test. h, In vitro methylation 
of CG-, CA-, or CT-containing DNA using DNMT3A! (left) or 
DNMT3A®*W (right) in complex with DNMT3L, reacted for 40 min 

(n = 3 biological replicates). The methylation levels relative to 
CG-containing DNA substrates are plotted. Data are mean + s.d. 
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Extended Data Figure 9 | Effect of haematological cancer-associated 
mutations of DNMT3A on genomic DNA methylation in the TF-1 
leukaemia cells. a, Immunoblot of the TF-1 cells stably transduced 

with Myc-tagged DNMT3A, either wild type or the indicated cancer- 
associated mutants. EV, empty MSCV vector. A representative blot 

of two independent experiments is shown. For gel Source Data, see 
Supplementary Fig. 1. b, Profiling of the indicated DNMT3A-expressing 
TF-1 cell lines with the HumanMethylation_450K BeadChip array reveals 
the mean methylation ( values for all examined CpGs. Each dot represents 
a biological replicate: that is, an independently derived stable-expression 
cell line (n = 3-8 biological replicates per group; mean + s.d.). Statistical 
analysis used a two-tailed Student's t-test. c, Density plot of methylation 
@ values for all examined CpGs in the indicated DNMT3A-expressing 
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TF-1 cell lines. The inserted box shows a zoom-in view for densities for 
highly methylated DNA sites among the indicated samples. Data are 
mean +s.d., with the labelled P values. Statistical analysis used two-tailed 
Student's t-test. d, Sanger bisulfite sequencing of the indicated regions 
from TF-1 cell lines stably transduced with empty vector, DNMT3A™! or 
the indicated cancer-associated mutant. Individual CpG sites (circles) are 
filled with black (methylated) or white (unmethylated). Red circles denote 
the CpG sites covered by the Illumina Infinium 450K DNA methylation 
array. Data are mean + s.d.; n = 3 biological replicates. e, Methylation 
values of the indicated CpG sites (labelled by red circles in d) based on the 
measurements with the Infinitum 450K DNA methylation arrays (n = 3-8 
biological replicates; mean + s.d.). 
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Extended Data Figure 10 | See next page for caption. 
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Extended Data Figure 10 | Effect of haematological cancer-associated 
DNMT3A mutations on DNA hypomethylation and cytokine- 
independent growth of the TF-1 leukaemia cells. a, Venn diagram 

of CpG sites with hypomethylation induced by either one of the three 
indicated strong DNA-binding-defective mutants of DNMT3A, V716D, 
K841E, and R792H. b, c, Bar plots (b) and scatter plots (c) showing 
methylation difference at the 605 commonly hypomethylated CpG sites 
identified in a among the TF-1 cells with stable transduction of either 
DNMT3A' or the indicated mutant, compared with empty MSCV 
vector. A black line in b indicates empty vector control (n = 3-8 
biological replicates per group; mean + s.d.). In c, mean methylation 

@ values are plotted of each individual CpG in the indicated DNMT3A 
experimental group (x axis) and control empty vector group (y axis; 

n= 3-8 biological replicates per group). d, e, Comparable occupancy 
of DNMT3A and its mutant forms at the indicated genomic loci 

with affected DNA methylation, as measured by ChIP analysis of the 
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Myc-tagged DNMT3A" or mutants in TF-1 stable-expression cell lines. 
Tested sites by ChIP-qPCR in d (n =3 biological replicates; mean + s.d.) 
included three CpG sites showing hypomethylation due to expression 

of mutant DNMT3A (left; also see Extended Data Fig. 9e), three sites 
showing hypermethylation due to expression of DNMT3A (middle; 
also see e, which shows mean methylation ( values from measurements 
with the Infinium 450K DNA methylation arrays, n = 3-8 biological 
replicates; mean + s.d.), and a negative control locus (right; the GAPDH 
transcription start site). The anti-Myc antibody was used for ChIP and 
the empty-vector-expressing TF-1 cells used as cell control for unspecific 
binding. f, Proliferation of the indicated DNMT3A stable-expression TF-1 
cells in the presence of a supporting cytokine, GM-CSF (n = 2 biological 
replicates; mean + s.d.). g, Proliferation of the indicated DNMT3A- 
expressing TF-1 cells after GM-CSF withdrawal (n= 3 biological 
replicates; mean + s.d.). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature25163 


Corrigendum: AMPA receptor- 
mediated regulation of a 
G;-protein in cortical neurons 


Yizheng Wang, Daniel L. Small, Danica B. Stanimirovic, 
Paul Morley & Jon P. Durkin 


Nature 389, 502-504 (1997); doi:10.1038/39062 


It has come to our attention that Fig. 4 in this Letter presented three 
events of data duplication. In Fig. 4a the panels in row 1, columns 1 
and 7 were the same, in Fig. 4a the panels in row 3, columns 5 and 7 
were the same, and in Fig. 4c and e, the panels in columns 1-3 were the 
same. Given the time elapsed since publication, we could not locate the 
original raw data. The main conclusion illustrated by Fig. 4, however, 
of dose-dependent (Fig. 4d) and GB+-sensitive (Fig. 4f) activation of 
Gail by AMPA in MIN6 cells, which do not express GluR6 (Fig. 4b), 
remains unaffected. Authors J.P.D. and P.M., now both retired, could 
not be reached. The original Letter has not been corrected online. 
Correspondence should be addressed to Y.W. (yzwang@ion.ac.cn). 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature25445 


Erratum: Runx3 programs CD8* 
T cell residency in non-lymphoid 
tissues and tumours 


J. Justin Milner, Clara Toma, Bingfei Yu, Kai Zhang, 

Kyla Omilusik, Anthony T. Phan, Dapeng Wang, 

Adam J. Getzler, Toan Nguyen, Shane Crotty, Wei Wang, 
Matthew E. Pipkin & Ananda W. Goldrath 


Nature 552, 253-257 (2017); doi:10.1038/nature24993 


In this Letter, owing to errors introduced during the proofreading 
process, the words ‘infection with were missing from the sentence 
“Furthermore, Runx3 RNAi also impaired Try cell differentiation in 
the context of a localized infection with enteric Listeria monocytogenes 
expressing GP33_4; (LM-GP33_41) (Fig. 2b) In addition, in Fig. 1a, the 
x-axis label for the bottom right graph should have read “Expression 
change log»(D7 kid/D7 Tc)” rather than “Expression change log,(D35 
kid/D35 Tem)”. In Fig. le, the arrow pointing from the spleen to Tom 
should have been enlarged and aligned with the arrow above, and in 
the heat map in Fig. 1f ‘Irf# should have been non-italic upright font. 
These errors have all been corrected in the online versions of the Letter. 
Supplementary Information to this Corrigendum shows the original 
uncorrected Fig. 1, for transparency. 


Supplementary Information is available in the online version of this 
Corrigendum. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature25471 


Erratum: Moving beyond 
microbiome-wide associations 
to causal microbe identification 


Neeraj K. Surana & Dennis L. Kasper 


Nature 552, 244-247 (2017); doi:10.1038/nature25019 


In this Letter, errors in Fig. 2c were inadvertently introduced during the 
production process. The key to the survival graph should state ‘HMb 
for the green line rather than ‘MMb, SHMbMMb—!& for the blue line 
rather than ‘MMb#™>—!® and SHMb™M™>—3 for the red line rather than 
‘MMb!™>-3* The original figure has been corrected online. 
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Marine ecologist Sandra Binning recommends that researchers who are pregnant take a few extra steps of preparation before going out into the field. 


WORK-LIFE BALANCE 


Child on the horizon 


Fieldwork can be tough ona pregnancy — but there are ways to manage. 


BY EMILY SOHN 


onservation biologist Amy Dickman 
had built her career around 
remote fieldwork in Namibia and 
Tanzania — extended trips that included close 
run-ins with wild animals. She had encoun- 
tered venomous cobras in toilets and bread 
bins. She had been charged by elephants. And 
she had been attacked by a cheetah. During 
her first expedition to Tanzania in 2004, a lion 
sniffed at her tent and eventually fell asleep on 
top of it, trapping her arm beneath its body. 
The likelihood of such events — both 


thrilling and terrifying — was high in her 
work, which aimed to reduce conflicts between 
humans and carnivores. So when Dickman 
learnt in 2013 that she was pregnant, it felt at 
first as if she might be risking a huge career 
setback. 

She worried that the pregnancy would 
interfere with her usual field trips to the 
African wilderness, where she often spent eight 
months at a stretch doing research that has 
helped to reduce killings of carnivores by 80% 
(see go.nature.com/2guyn 1h); it has also led to 
multiple career and funding awards and some 
40 publications in peer-reviewed journals. And 


she fretted that colleagues, superiors and oth- 
ers might perceive her as professionally weaker 
— although everyone, including her university, 
was supportive. 

Equally disquieting, she says, was the new 
equation she faced when making decisions 
about risks: she was now carrying a tiny person 
that could not offer its own consent to being 
plunged into dangerous situations. “You're 
directly responsible for someone else within 
your body,” says Dickman, who has appoint- 
ments at the University of Oxford, UK, and the 
University of Vermont in Burlington. And oth- 
ers, she notes, including the baby’s father, 


15 FEBRUARY 2018 | VOL 554 | NATURE | 393 
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


> also havea vested interest in risk mitigation. 

As academic researchers worldwide struggle 
with a model that places increasing impor- 
tance on every publication, grant and keynote 
talk, many female researchers whose careers 
hinge on remote or challenging fieldwork ago- 
nize over the idea of getting pregnant. They 
wonder whether they can continue to travel 
into the field to dig for fossils or scuba dive for 
data, and whether they will need to change or 
curtail the type of research they do. 

Yet those who have continued with expedi- 
tions say they find that the experience can be 
safe and rewarding, and beneficial for science: 
their successful field trips show other female 
researchers, for example, that they, too, can be 
scientists and also plan a family. 

But health in pregnancy is unpredictable, 
as are conditions in the field. After months 
of preparing for trips that might involve large 
teams and complicated logistics, it can be hard 
to decide whether to go and, once there, to 
determine how best to deal with pregnancy- 
related complications or disheartening 
comments from superiors or others. 

Scientists who have balanced pregnancy 
with remote or otherwise risky fieldwork rec- 
ommend detailed planning for medical care 
abroad along with frank communication with 
superiors and colleagues about their specific 
fieldwork aims. A willingness to shift gears can 
be helpful: sometimes, cancelling an excur- 
sion can be the right choice. Comparing the 
length of a pregnancy with the timespan of a 
typical career can also provide a helpful sense 
of perspective. 

“Over the arc of a career, these are little blips. 
says Kim Cobb, a climate scientist at the Georgia 
Institute of Technology in Atlanta. “The idea 
that a field trip is make-or-break is kind of folly. 
You'll have ample opportunity over the course of 
your career to participate in field trips.” 


DELICATE BALANCE 

Some scientists try to make deliberate choices 
about when to start a family as a way to ease 
the transition into a new phase of life. As a 
graduate student at the Scripps Institution of 
Oceanography in La Jolla, California, Cobb 
loved having “two surfboards and a career that 
stretched out in front of me full of possibility, 
publications and adventure”. 

But she wasn't sure how a pregnancy might 
affect the feeling of freedom that she so rel- 
ished. Because she and her husband (also an 
academic researcher) were based in the United 
States, which has less-generous parental leave 
benefits than do many other countries, they 
decided to delay their first pregnancy until 
they each felt relatively secure in their jobs. 
They now have four children, and Cobb 
resumed trips into the field when each infant 
was nine or ten months old. 

Cobb feels lucky that she could control her 
family timing in a way that worked out well 
for her. Some female researchers, whether 


for personal or for other reasons, don’t want 
to wait — or don't have the option of wait- 
ing — until their careers are established before 
they become pregnant. And sometimes, of 
course, a pregnancy comes as a surprise. The 
uncertainties that surround a pregnancy can 
cause conflicts when an expedition requires 
detailed planning. As a result, questions about 
when and how to tell superiors can arise earlier 
than might be comfortable for some. 

Dickman, for example, felt obliged to tell her 
supervisor about her pregnancy when she was 
only seven weeks into it, much sooner than 
many women would share such news, because 
she would need to shift the timing of a field trip 
to avoid taking anti-malaria medications in her 
first trimester. “This is something we shouldnt 
feel apologetic about,’ she says. Her supervisors 
were fully supportive of her choices. 

It is often best to emphasize the need 
for confidentiality when informing others 
about a pregnancy, says Sian Halcrow, a bio- 
archaeologist at the University of Otago in 
New Zealand. Early in her first pregnancy in 
2004, she learnt that she was not legally obliged 
to tell her superiors until her second trimester. 
But she privately shared the news, which was 
leaked to her department head. 

When she opted to tell him herself, he said 
that he already knew. Although her career was 
not affected by this breach of confidence, she 
was still upset by it. 

Halcrow also recommends documenting 
communications. Her plans for a field trip to 
another country were cancelled once against her 
will, and she now advises pregnant researchers 
to put all discussions into writing, and to follow 
up face-to-face meetings with e-mails that 
summarize the conversations and underscore 
the person’s desire to stay involved in fieldwork. 
“Pregnancy is nota failure; she says. 


RISKS AND RECOMMENDATIONS 
Travelling while pregnant raises some health 
risks, especially when the destination is remote 
or endemic for specific diseases, such as Zika, 
which can cause birth defects. The World 
Health Organization in Geneva, Switzerland, 
recommends the second trimester as the safest 
for travel, advises against sleeping at altitudes 
above 3,000 metres during pregnancy and 
warns of serious complications if pregnant 
women contract malaria or viral hepatitis E. 
The risk of developing deep-vein thrombosis 
during flights is five to ten times higher than 
average, or about 1 in 1,000 to 1 in 500, for 
pregnant women, according to the US National 
Institutes of Health (M. Izadi et al. Adv. 
Biomed. Res. 4, 60; 2015). The organization 
recommends that women who travel during 
pregnancy pack a medical kit that includes pre- 
natal vitamins, compression hosiery, antiemetic 
drugs and a blood-pressure monitor. 
Physicians can offer important advice about 
forthcoming field trips, says Cobb. She opted 
out of a trip during her second pregnancy 
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Archaeologist Briana Pobiner in the field in Kenya. 


after asking her obstetrician about visiting 
a research site on an isolated Pacific Ocean 
island, from where it would have taken a week 
to reach medical care. The obstetrician vetoed 
the trip, Cobb says. Cancelling doesn't have 
to ruin a research project for ever, she adds. 
For example, she has quickly received ‘no- 
cost’ grant-funding extensions from the US 
National Science Foundation, allowing her to 
postpone fieldwork. 

Considering a project’s importance to 
big-picture goals can help to guide choices, 
says Sandra Binning, a marine ecologist at the 
University of Neuchatel in Switzerland. While 
pregnant, she travelled to a long-familiar 
remote field site in Australia near the Great 
Barrier Reef: the work there was essential to 
her long-term research and she knew that the 
local field crew was supportive. But she can- 
celled a mid-pregnancy trip to Egypt — where 
she was meant to co-teach a class on field 
research to graduate students — because 
she had contracted a food-borne illness on 
previous trips. A graduate student who was 
originally scheduled to observe the course for 
training replaced her, with no repercussions. 

Extra preparation can smooth fieldwork 
trips and mitigate anxiety. Before going to 
Australia, Binning asked her obstetrician to 
sign documents that said she was fit to travel, 
and told field-station directors about her 
pregnancy in case of complications. She also 
flew home when she was about seven months 
pregnant because some airlines can stop 


NICK WALTON 


women with advanced pregnancies from 
travelling on long-haul routes (see go.nature. 
com/2fxptyv). And she called her university, 
her health-insurance company and each 
airline that she would be flying with to find 
out whether she would need a physician’s 
approval to fly home at the end of the trip. 

Those who have been pregnant while 
doing fieldwork recommend researching 
vetted medical providers and learning about 
the safety profile of medications that might 
be necessary in the field. Friends, colleagues 
and others can answer questions and pro- 
vide emotional support. Before a two-month 
trip to Kenya during her pregnancy, Briana 
Pobiner, an archaeologist at the Smithsonian 
Institution in Washington DC, got in touch 
with colleagues and others who she knew 
had done similar travel. She also contacted 
the East African chapter of her undergradu- 
ate institution’s alumni association to learn 
about the challenges they faced and how they 
managed them. Those conversations helped 
her to locate a hospital close to the field site, 
as well as an obstetrician in nearby Nairobi 
whom she could see for check-ups. 

Such advice from a wider community can 
bea great help, agrees Suzanne Pilaar Birch, 
an archaeologist at the University of Georgia 
in Athens. Soon after she became pregnant 
in 2016, she was 
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was unsure whether 
to go. What if she 
committed to the 
trip but suddenly faced complications? And 
would she have to tell the team that invited 
her before she agreed? 

To seek advice, she started a conversation 
on Twitter through TrowelBlazers, a network 
of female archaeologists and Earth scientists 
that she had helped to create several years ear- 
lier, under the hashtag #pregnantinthefield. 
The torrent of responses from scientists 
in many disciplines included some tales 
of pregnancy complications that had kept 
researchers home. But many more of the 
anecdotes were positive, and women posted 
photographs of themselves doing such tasks 
as measuring iguanas or excavating fossils in 
countries as varied as Belize, Oman, Croatia 
and Chile. Ultimately, Birch accepted the 
invitation to Cyprus, told her hosts about the 
pregnancy in the initial phone call and went 
ona smooth trip. She was happy to get in one 
last drive for data before the baby came. 

Seeing other pregnant women in the field 
can be a major confidence boost, Binning 


possible.” 


adds. When she started her career as a marine 
ecologist, she took a two-week course in 
Barbados, and one of the three instructors 
was pregnant at the time. “I had an image of 
a pregnant woman doing field work, and it 
didn’t seem to present any significant prob- 
lems for her,’ she says. “I always had that in 
my head. When I got pregnant, I never saw 
it as a barrier. It is so important that other 
women get these anecdotes and stories and 
see that it’s possible” 

Anda visible pregnancy can create oppor- 
tunities to connect with people from another 
culture. As a female leader of a research team, 
Dickman says, she had always been treated 
as an outsider by the women living in native 
communities near her Tanzanian field site. 
But when local women on her field staff saw 
that she was pregnant, they began to quietly 
bring her extra boiled eggs and more hot 
water to shower with. 

Some researchers find that their attitude 
to risk changes as a result of being preg- 
nant. While driving through a game reserve 
in Kenya one night during her pregnancy, 
Pobiner and a colleague almost hit an ele- 
phant on a dirt road. The near miss jolted 
her into realizing that she could never again 
afford to be as casual as she once was about 
health and safety in the field. “I thought, 
‘Wow, this was scary — it’s not just me I have 
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to worry about any more,” she says. 


STAND UP FOR YOURSELF 

Pregnancy can be a learning experience that 
extends beyond the actual physical condition, 
Cobb says. She was three months pregnant 
in November 2006 when she travelled from 
the United States to the Bahamas to consult 
ona coral-reef filming project. On her physi- 
cian’s advice, she avoided scuba diving with 
the team. But she boarded the boat every 
morning to offer advice and support. 

On the second day, choppy seas slammed 
the boat up and down, and Cobb worried 
that the impact might be harming the fetus. 
Although she asked the boat captain to slow 
down, the ride continued to be rough, and 
she was uneasy that night and beyond. 

Cobb’s son was born healthy and at full 
term the following spring. But the experi- 
ence taught her the value of standing up for 
herself and of discussing concerns with team 
members before setting out on an expedition. 
She now talks to her field teams about speak- 
ing up and looking out for each other before 
they charge ahead on a precious field day. “I 
explicitly say, “The field rule is that we are all 
in this together and there are natural risks to 
a field expedition, so let’s go through them 
one by one. You need to say, ‘I feel uncom- 
fortable’ or ‘I feel unsafe’” she says. “You need 
to be your own advocate.” m 


Emily Sohn is a freelance journalist in 
Minneapolis, Minnesota. 
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GENDER DIFFERENCES 


Language matters 


Two studies highlight yet another set of 
differences between female and male 
researchers. One suggests that messages 
that link career success to ‘brilliance’ in 
science can discourage some women 
from pursuing certain career paths or 
education opportunities. The other finds 
that women are more likely than men to 
offer ‘honorary authorships’ to scientists 
who might not deserve the accolade 

— acourtesy that risks obscuring the 
magnitude of their own contributions. 

In the first study (L. Bian et al. J. Exp. 
Soc. Psychol. http://dx.doi.org/ doi. 
org/10.1016/j.jesp.2017.11.006; 2018), 
researchers surveyed nearly 
200 undergraduates about their interest 
in hypothetical internships and in 
studying certain university subjects. 
Consistently, women were less keen about 
the possibilities when the descriptions 
emphasized the importance of brilliance 
by asking, for example, for an “intellectual 
firecracker” with a “sharp, penetrating 
mind”. But when descriptions of the same 
options used language such as “great 
focus and determination” — highlighting 
the importance of hard work and 
dedication — womens interest grew 
significantly. Conversely, men were 
generally more interested when 
descriptions emphasized intelligence 
over effort. 

The gender difference could have 
real consequences for students and 
researchers, says lead author Lin Bian, 

a psychologist at Stanford University in 
California. “Women are not motivated to 
pursue fields or jobs that are perceived as 
requiring intellectual talent or brilliance,’ 
she says; rather, she thinks that women 
are more likely to gravitate towards a field 
when scientists emphasize other keys to 
success in it, including hard work. “It’s 
important to de-emphasize the role of 
brilliance in achieving success.” 

Ina second study (E. A. Fong and A. W. 
Wilhite PLoS ONE 12, e0187394; 2017), 
researchers at the University of Alabama 
in Hunstville looked at survey responses 
from more than 12,000 US scholars from 
a wide range of disciplines. Overall, 35.5% 
of respondents reported giving at least one 
‘honorary authorship’ to a researcher who 
contributed little to the paper. Women 
were 38% more likely than men to have 
felt obliged to give honorary authorships. 
“Female researchers’, the authors conclude, 
“may be less able to resist pressure to add 
honorary authors because women are 
underrepresented in faculty leadership and 
administrative positions in academia and 
lack political power.” = 
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Ua SCIENCE FICTION 


BY ERIC LEWIS 


he two thieves stood before the 
T giant vault, fresh out of ideas. 

“Well?” Braygin prodded, 
“what do you think?” 

Tesca shrugged. “I think the 
Regent's not going to be pleased” 

“The Regent's never pleased. When 
he hears about this he'll be furious.” 
Two of the vault’s magnetic locks 
hung open; those had been easy. But 
the third... Braygin kicked a box of 
gear at their feet. “Are you sure you've 
tried everything, little man?” 

“Everything! Electronic picks, 
lockout bypasses — even nanoexplo- 
sives didn’t make a scratch. I’ve never 
seen a system this well guarded.” 

“Hmm. Guess that means we're in 
the right place,’ said Braygin, wiping 
a bead of sweat from the back of his 
neck. “Those rumours about Admi- 
ral Gnossi hiding her money in the 
gas oceans of Denovia must’ve been hog- 
wash. But how do we get in there?” 

Tesca fidgeted in his cumbersome 
spacesuit. “Whatever we do, itd better be 
fast — security system comes back online in 
15 minutes, and we'll need 5 of those to get 
back to the ship and blast off? 

“Damn. To come so close!” Braygin 
slammed a fist against the plasteel face of 
the vault. He could almost taste the loot just 
out of reach. After months of bribing and 
snooping out the location of the asteroid 
base, they'd burgled deep inside to rob the 
Regent's biggest rival blind. Even a small 
share of whatever they stole could buy them 
each a small moon to retire on, but it seemed 
that the vault had defeated even Tesca’s 
technocriminal talents. “There's absolutely 
nothing you haven't tried?” 

“Nope, nothing. Erm, well...” 

“Erm, well what?” 

Tesca looked away. “I wasn’t gonna say 
anything —” 

“Say it anyway!” 

Tesca knelt and rooted through the box, 
tossing the various bits of equipment that'd 
failed them about the dim chamber. He rose 
holding an awkward, cobbled-together mass 
of wires and metal. An antimatter symbol 
emblazoned on the side screamed warning. 
“Tt’s just a prototype...” 

“A prototype what?” 

“A portable spatial-displacement-field 
generator.” 


DECOY 


The score of a lifetime. 


“Spatial ... what, like in the engines that 
move starships?” 

“Well, technically, it’s not the ship that 
moves, it’s space itself —” 

“Whatever,” said Braygin. “You mean you 
built a handheld version? Incredible! You 
could ‘displace’ us right through the door! 
Why didn’t you start with that?” 

“Like I said, it’s a prototype... also, the 
parts are highly illegal” 

“Tllegal? Tesca, we're breaking into Admiral 
Gnossi’ vault. Just use the gadget so we can 
grab the cash and get outta here” 

“But it’s too dangerous! Without a full scan 
we've no idea what's on the other side. There 
could be guardbots, automated defences —” 

Braygin clapped his partner on the 
shoulder. “Look, this is the score ofa lifetime 
—T'll take the risk. Unless you want to be the 
one to bring the Regent the bad news?” 

“,.-all right. Gimme a second to boot it up” 

“Ten minutes and counting.” 

While Tesca worked the controls of the 
device, Braygin again eyed the great circu- 
lar door, licking his lips with anticipation. 
“This is it. It’s finally happening, after all 
these years. My big payoff. No more penny- 
ante con jobs, no more rackets. No more 
bowing and scraping before thugs like the 

Regent...” He turned 
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pretty simple. You set your field 
radius, set your displacement vector 
— say, three metres straight ahead for 
this door — then hit go. Only enough 
juice for a couple short jumps so we 
best go together... there, all set.” 
Tesca put the device down and went 
to retrieve his helmet — the atmos- 
phere in the tunnels was thin but 
breathable and theyd quickly set the 
protective gear aside. “Better suit up 
all the way, just in case — oof!” 

While Tesca’s back was turned, 
Braygin shoved him hard, and the 
low-power grav plating sent him 
tumbling. Braygin grabbed the 
displacement device and caressed 
it greedily. “Sorry Tesca, like I said, 
this is the score of a lifetime, and I 
find I ain't willing to split it. Nothing 
personal; youd do the same in my 
place if you had any brains for busi- 
ness. So long, little man!” 

“No, Braygin, don't ...!” Tesca 
watched as Braygin was enveloped in a blue 
orb, then winked out of existence along 
with a little section of tunnel floor hed been 
standing on. Tesca scrambled for his comm 
unit. “Braygin! Are you there? Braygin you 
lying cheat, talk to me!” Nothing. 

After a few stunned seconds Tesca heard a 
low bang emanate from the vault door. From 
the other side it seemed. Then another, 
much weaker. Then silence. “Braygin?” He 
rummaged through his box of gear for a 
data pad and tried to patch into Braygin’s 
suit camera. Although the signal was weak 
through the insulating plasteel, he managed 
to pick up an image, but no sound. The 
picture was rotating, slowly. He saw the 
other side of the door, then a wall of rock. A 
gloved fist drifted across the camera’ field 
of view, and when it passed Tesca saw only 
a field of stars, stark against the blackness 
of open space. 

“Huh,” Tesca muttered to the empty cham- 
ber, glancing at Braygin’s helmet left forgot- 
ten on the floor. “I guess Admiral Gnossi 
does keep her money on Denovia after all” 

Tesca gathered up his most precious gear 
and scuttled back to the ship. No, the Regent 
was most definitely not going to be pleased. m 


Eric Lewis is an organic chemist still 
learning how to be a person again after 
surviving graduate school. His stories have 
appeared in Electric Spec, Bards and Sages 
Quarterly and other online venues. 
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